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ie INTRODUCTION 


Database technology 1s rapidly becoming an extremely 
important aspect of data processing. Commercial database 
management systems have only been available since the 
1940's. Today, many thousands of orqanizations Le4:6o.% 
corporations, universities, qovernments) are critically 
dependent on the efficient and reliable operation of these 
systems. Fach of these organizations has invested large 
amounts of time, energy, and money to ensure that the 
various end users are provided the data they need for doing 
their jabs as effectively and efficiently as possible. Any 


of the three generally Enown approaches to tne desian oft 


database systems Mi. ess network, hierarchical, and 
relational) provides for the centralized control Of an 
organization's operational data. However , questions 
concerning the ease of understanding, use, and 


implementation have stimulated research to determine the 
“pest" approach. The earliest database systems were based 
on the network or the hierarchical model. These models lend 
themselves well to the efficient implementation necessary 
#or the maintenance of large databases. Today, with the 
imcreased emphasis on the ease of use and understandina, 
many of the newer commercialized systems are based on the 


relational model. Examples of commercialiy avaliabie 
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systems based on these models include: IMS (hierarmchicalog 
S@QL/DS (relational), and IDMS (network). Fach of these 
systems utilizes a model-based data language which allows 
the user to specify the operations to be performed an the 
date. 

Once aA commitment 1S made to manage a large database 
containing an organization ’s operational data through the 
implementation of one of these systems, 1t 1S financially 
pronibitive to change to another appraach. In addition to 
tne obvious re-programming requirement, user personnel 
(including high-level executive users) must De re-trained in 
the syntax and semantics of a different data language. 
Demurjian, et. al., Nave praeposed an attractive alternative 
to the development of separate, Stand-alone systems = ¢for 
specific modeis. Their research, reported in (CRef. 1 de 
proposes that a system can be designed "...to support 
multiple data models and model-based languages as i1f the 
system 1S a heterogeneous collection of database systems." 

The above proposal 1s based on the existence of a simple 
and powerful data model to which the network, hierarchical, 
and relational models can be mapped. This 1s the attribute- 
based data model as originally described by Hsiao (CRef. 2a 
and extended by Wong (CRef. ee This 1s the data model of 
the Multi-backend Database System (MDES) , a software 
database system designed by Menon and s«Hsiaom [EREf- ae 


The proposal of [CRef. 17 is that the attribute-based system 


VHPBS), with the attrioute-bpased data modei and the 
attribdute-based data language (ABDLI, can serve as a Kernel 
for the support of several data modeis and the data 
languages based on those models. 

The attribute-based system 1s ideaily suited to its 
proposed role as ae kernel otf database systems. As 
demonstrated by Banerjee CRefs. 5, 6, and 73, a relational, 
hierarchical, Or network database can be converted into an 
attribute-based database. The primary database and 
aggregate operations, RETRIEVE, INSERT, DELETE, UPDATE, MIN, 
MAX, SUM, COUNT, and AVG are supported by the system’s high— 


level data language, ABDL. Finally, language interfaces can 


be developed to translate relational, hierarchical, or 
network data language constructs into ABDL constructs. In 
this thesis, we are concerned with the language interface 


aspects of this research. 

In particular, this thesis provides the desiqn = and 
analysis of arelational interface to the attribute-based 
system (MDHS). We extend the work of Macy CRef. 81, who has 
shown that a subset of the relational model-based data 
language, SOL (Structured Query Language) can be directly 
supported by MDBS and AHDL. Macy has provided mappings from 
the SOL SELECT, INSERT, DELETE, and UPDATE constructs to the 
corresponding ABDL constructs: RETRIEVE , INSERT DELETE, 
and UPDATE. The translations are limited to queries 


Involving simple, Single-relation operations. Using these 


— 
ty] 


basic mappings as a foundation, we show that SDL queries 
imvolving set membership operations can also be mapped 
directly to ABDL canstructs. We also demonstrate that otner 
SQL constructs ‘Of particular importance, the nested SQL 
SELECT) can be mapped to a series of ABDL operations. 
Finally. we propose a software structure to facilitate the 
implementation of a complete relational interface far the 
attribute-based kernel (1.e., MDEBS). In the following two 
sections, we discuss our design goals and our unconventional 
epproach to the design of the S@L interface. In the last 
section of this chapter, the organization of the thesis 1s 


presented. 


Ae DESIGN GOALS 

We are motivated to desiqn a SOL interface to MDBS in 
order to demonstrate the feasibility of utilizing the 
attribute-based system as the kernel of database systems in 
general. However , our intention is not to propose changes 
to MDES 2tserr-. Instead, we propose that the S@L interface 
be implemented on the host computer. All translations are 
accomplished in the S@L interface. MDBSG continues to 
receive and process requests written in the syntax of AHDL. 

Related to the goal of avoiding modifications to the 
functionality of MDBS is the goal of keeping the syntax of 
ABDL intact. We utilize existing ABDL constructs in Our 
query translations. A single S@L query may map to one ABDL 


request or a series of ABDL requests. The processing of one 
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request may depend on the results of some other request in 
tne series. Cleariy, the interface must include some method 
of controlling the iterative processing of series of 
reauests. The software structure of Our proposed interface 
(described in Chapter VI and augmented in Chapter VII) 
provides for this iterative control. 


As discussed above, we have made it our goal to leave 


MDBS and ABDL unchanged. We also desire to make our 
interface transparent to the SQ@L user. For example, in a 
corporate environment, a new employee with previous 


experirence with SOL/DS should be able to log in at a system 
terminal, input a SQL request, and receive result data ina 
relational format (1.e., a table). The employee requires no 
training in WMDBS or ABDL procedures prior to utilizing the 
system. An obvious advantage 1s that the new employees 
pecomes ai contributing member of the organization almost 
immediately, with no retraining. The non-productive period 


of new employee indoctrination 1S greatly reduced. 


BR. APPROACH TO DESIGN 

Dur approach to the design and analysis of a SOL 
interface to MDBS 1s unconventional by today’s standards. 
The normal method is to design a system in a top-down 
manner. High-level abstractions are considered first, while 
deferring lower-level details. In this thesis, we consider 
the lowest levels first. We are building upon the basic 


subset of SQL-to-ABDL mappings provided by Macy (CRef. Bais 
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As additional S6L operations are incorporated Lhe 
the interface, we make appropriate additions to the set of 
S@L—-to-ABDL mappings. The functional requirements of = an 
overall software structure for the interface become apparent 
im Chapter V, when we present ABDL translations for the 
nested SHL SELECT. The functionality and organization of 
structure components is described graphically, in text, and 
through the presentation of high-level algerithms. We 
reiterate that, in the development of the SQL interface, 
MOBS 1s considered to bea "black box" which processes 
Catabase requests presented in the syntax of ABDL. We are 
Proposing an interface, residing on a host computer, which 
enables aouser to access a relational database iumplemented 
on an attribute—-based system. Recommendations tom 
modification within the structure of MDBS are made only i1fa 
desirable S@L operation cannot be supported by existing ABDL 
operations. 

Our approach to the presentation of SQL-to-ABDL mappings 
is as follows. We first review the direct mappings (1.e., 
SeEEEOT nein te Vee INSERT/ INSERT, DELETE 7 Dee ies and 
UPDATE/UPDATE) developed by Macy CRef. 84. Heginning in 
Chapter IV, we investigate additional operations to be 
supported by the interface. The functionality of each of 
these operations is thoroughly explained through the use of 
example queries. The equivalent ABDL requests are then 


determined. 
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Sll examples on database operatians presented in this 
thesis are based on the Suppliers-and-Farts datapase 
depicted in Date [CRef. Sat This database contains three 
relations: a, (Suppliers), cam (Shipments), and "“F" 
(Parts). We use many of Date’s examples directly because 
they are well-known, thereby facilitating reader 
understanding of our SOL to ABDL translations. The database 


15 depicted in Figure 1. 


C. ORGANIZATION OF THE THESIS 

In Chapter Ty we present an overview of the 
organization and functionality of the Multi-backend Database 
System THDES).. Also presented are descriptions of the 
attribute-based data lanquage (ABDL) and the relational data 
language CSG). Chapter III reviews the direct SOL-to- 
ABDL mappings as developed by Macy CRef. 81. SOL set 
membership operations involving single relations, and the 
equivalent ABDL requests are explained in Chapter Ey 
Chapter V explains set membership operations on muitiple 
relations (1.e., mested SELECT?. In Chapter VI, a software 
structure 1S proposed to facilitate the implementation of 
nested SELECTs. In Chapter VII, the interface software 
structure is modified to include the functionality necessary 
to accomplish the translation of other single-relation = and 
multiple-relation operations. Chapter VIII presents our 


conciusions and recommendations for future research. 
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SNAME STATUS CITY 


London 
Paris 
Faris 
London 
Athens 










Figure Ac: The Suppliers-and-Parts Database. 
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Lie THE MULTI-BACKEND DATABASE SYSTEM (MDBS)., ITS DATA 
LANGUAGE (ABDL) AND THE INTERFACE LANGUAGE (SOL) 


mn rm 





AS we begin our investigation into the feasibility of 
designing and implementing a complete relational interface 
for the Multi-backend Database System (MDES), it 1s 
important to gain @ general familiarity with the 
organization of MDBS and with the system’s attribute-based 
data language (ABDL). We have selected the Structured Query 
Languege (SQL as the reletional data language to be 
supported by our interface. Therefore, we must aiso nhave an 
understanding of the structure and capabilities of this 
language. 

Im Sections A and B, we briefly describe MDBS and AHDL, 
respectively. Section C provides a brief description of 
SOL. These descriptions, though somewhat superficial, 
shauld enable the reader to comfortably follow subsequent 
discussions. A complete description of MDBS and ABDL can be 
found in Hsiao (CFRefs. 4 and io]. The reader 1s referred to 
Astrahan CRef. 11] and Chamberlin (Ref. 12] for in-depth 


discussions of S@L. 


A. A REVIEW OF THE MULTI-BACKEND DATABASE SYSTEM (MDBS) 
MDES 1S A MmuUltiple-minicomputer backend database 
computer. Off—-the-shelf hardware and specialized software 


are combined to provide database management service to a 


ao. 


host computer. Figure 2 depicts the hardware arqganization 
ot MDES. The hardware organization includes one 
minicomputer as a controller and muitiple minicomputers as 
backends. Fach backend has one or more dedicated disk 
drives. The controller and the backends are connected Dy a 
broadcast bus. The detahase 15 distributed across the disk 
Grives of the RBackend in such a manner that the backends can 
process requests in parallel, providing a significant 
pertormance advantage over traditional single-processor 
architectures. 

The prototype MDRBS, currently operating at the U.S. 
Naval Fostgraduate School, uses a VAX 11/786 as the 
controller and two PDF 11/744s as the backends. Each Vor 
these backends has one or more disk drives for its dedicated 
use. The multiple backends and the controller are connected 
ny DEC’s Parallel Communication Links (PCESIe Their 
broadcast capabilities are simulated in software. 

The major design goal of MDES 1s to provide ai high— 


performance system for large-capacity databases. Throughput 


improvement should be proportional to the number of 
backends, and the response-time reduction should be 
inversely proportional to the number of backends. A second 


design goal is that the system should be easily extensible. 
The system should be able to accomodate additional backends 
with no modification to existing software, and no new 


programming. The incorporation of additional backends 


one or more 


disk drives 
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disk arives 
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Figure 2. The MDBS Hardware Organization 


should not require modification to existing Nardware, and 
disruption of system activity should be minimal. The 
software structure of MDBES provides this extensibility. The 
software of the backends 15 identical, utilizing identical 
operating software for the additional backends. 

Et 1s)6cClear)6=6that—)2—=6the)=6ccontroller could become a 
bottleneck. MDBRS reduces this potential by minimizing the 


role af the controller and maximizing the amount of work 


done by the backends. The software structure of MDBS is 
shown 1n Figure 3. The functions of the controller are 
limited ta request preparation, insert information 


generation, and post processing. The request preparation 
functions are performed before a request 1s placed on the 
broadcast bus. These functions handle parsing, syntax 
checking, and the transformation of a parsed request into 
the form required for processing at the backends. The 
Insert information generation functions are performed during 
the processing of an insert request. These functions 
Provide additional information to the backends, such as the 
identity of the particular backend at which the record is to 
be inserted. The post processing functions are performed 
after replies are returned from the backends. For example, 
result data are collected prior to forwarding to the host 
computer. 

As described above, the controller does reiatively 


little work. The backends, on the other hand, are 
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Eyemene 3. The MDES Software Structure 


responsible for all the major database management functions. 
These are directory management, record processing, and 
concurrency control. The directory management funmctions 
determine the secondary storage addresses of the anpropriate 
records and perform directory table maintenance. The record 
processing functions store records into secondary ' storage, 


retrieve records from secondary storage, and select the 


records that contain the desired information. The 
concurrency control functions ensure consistency + om 


concurrent execution of user requests. 

The key to high-performance 1s in the paralleiism of the 
backends. The database is distributed across the disks of 
ali of the backends. Therefore, when a request 15 
broadcasted from the controller, each backend can execute 
the request on its portion of the database. To yield = an 
additional performance advantage, a queue of requests 15 
Maintained at each backend. Each backend schedules requests 
for execution independent of the activities of the Gthem 


backends. 


Es THE ATTRIBUTE-BASED DATA LANGUAGE (CABDL) 

We preface Our discussion of the syntax and 
fumctionality of ABDL with a brief introduction to the data 
model supported by MDES. This model is the attribute-based 
data model, originally developed by Hsiao (CRef. 2]. The 
following constructs are informally defined. A database 


consists of a collection of files. Each file contains a 


unique group of records. Each record 1s composed of two 
parts. The first of these parts is a coliection of 
attribute-value pairs or keywords. An attribute-valiue pair 
is an element of the Cartesian product of the attribute name 
and the domain of attribute values. As an example. 
ZSTATUS, 207 1s an attribute-value pair having 30 as the 
value for the STATUS attribute. In each record, there is at 
most one attribute-value pair for each distinct attribute 
defined in the database. The last part of each record 
contains textual information. This 1s the record body. An 
example of arecord without a record body 1s shown below. 
We note that all examples in this and subsequent sections 
are based on Date’s’ suppliers-and-parts database as 


Gescribed in CRef 9] and in Chapter I. 
See Sie, S>,<SH#,Si>,<SNAME,Smith?>,<STATUS, 20>,<CITY,London> ) 


The first attribute-value pair in every record indicates the 
file name. In the example above, the file name is ‘°S’ (the 
Suppliers file). 

The database can be accessed through the use of keyword 
predicates. Each of these keyword predicates 15 a three- 
tuple of the form (attribute, relational operator, value), 
2.0. , folrATuUSecaesO). When keyword predicates are combined 


nto a conjunction such as 


(EEE = Sy 7 (STATUS © =0)0 


or into a disjunction of CONIUNCTIONS SVE oS 


Cee 5) AS ASNaite Ssmith)) V 


te tt S) A (SNAME Jones) )) 


a query (in disjunctive normal form) of the database 15 
formed. 

In the following subsections, we will see how these 
keyword predicates and queries are used in the attribute- 
based data language for search and retrieval operations. We 
describe the syntax and functionality of the four types of 
request supported by ABDL: retrieve, insert, delete, and 
update. Appendix A provides a formal specification of this 
ron-procedural language. 

ite The RETRIEVE Request 

The RETRIEVE request allows the user to query the 
database for information. This operation obtains the 
requested data without altering the database. The syntax 


1s: 


RETRIEVE (Query) <Target-list?> CBY attributed CWITH Pointer] 


The type of the request 1s indicated by the reserved word 
RETRIEVE. As we have seen, the Query part is composed ot 
predicates in the disjunctive normal form. From our 
previous discussion, we note that the Query specifies the 
file and those records within the file which satisfy the 


request. The attributes for which values are to be 


extracted from this portion of the database are contained in 
the Target-list. ABDL supports five aggregate operations: 
AYG, COUNT, MAX, MIN, and SUM. Therefore, the attribute 
Value may be an aggregate of values fram multiple records, 
or the value from a single record. 

The BY and WITH clauses are optional, as indicated 
by the square brackets in the syntax. The BY-clause is used 
when a grouping by some attribute is desired. The WITH- 
Clause specifies whether pointers to the retrieved records 
must be returned to the user for later use in an update 
request. As an example of a RETRIEVE request, if we wish to 
obtain supplier names for all of the suppliers with STATUS 
greater than 10, grouped by location, we may use the 


FOLiiowing query: 
Benn lEVE «t(FILE = S? A (SFAFUS > 1O0)) <SNAME> BY CITY 


Tage The INSERT Request 
The INSERT request alters the database by adding a 


new record. fhe syntax is: 
INSERT Record 
An example of an INSERT request is: 
Poche «Fike .S-, «~S#. Si >, <SNAME,Smith> > 


This adds a record to the suppliers file for supplier number 


Si and identifies that supplier as Smith. 


=. The DELETE Reguest 
The DELETE request aiters the database by removing 
an existing record or records. The syntax 15: 
DELETE Query 
where Query specifies which records are to be deleted. An 


example of a DELETE request is: 
DELETE ((FILE = S) A  (Siares.— 10) 


This deletes all records in the suppliers file for suppiiers 
whose status 1S equal to 10. 
4. The UPDATE Request 
The UPDATE request alters the database by modifying 
the value of some attribute in an existing record. The 


syntax 15S: 
UPDATE Query Modifier 


where Modifier indicates which of five types of modification 
1s to be performed. These modifiers are defined as follows. 
A type-O% modifier sets the new value of the attribute being 
modified to a constant. A type-I modifier sets the new 
value of the attribute to be some function of its old value 
in the record being modified. A type-II modifier sets the 
new value to be some function of another attribute value 1n 
the record being modified. A type-III modifier sets the new 
value to be some function of another attribute value 1n 


another record identified by the Query in the modifier. A 


type IV modifier sets the new value to be some function of 
another attribute value in another record identified by the 
pointer ain the modifier. An example of an UPDATE request 
{using a type-I modifier) is: 

UPDATE (FILE = S) «STATUS = STATUS + 102 
which adds 190 to the status of all suppliers. 
C. THE RELATIONAL QUERY LANGUAGE (S@L) AS THE INTERFACE 

LANGUAGE 

AS aindicated in Chapter iI, we have selected the 
Structured @uery Language (S@L) as the data language to be 
supported by our relational interface to the Multi-backend 
Database System (MDHS). The languages commercial 
avallability coupled with 1ts simple yet power ful 
functionality make S@L an ideal choice. 

In the preceding section, we described the attribute- 
based data model prior to introducing ABDL. However, in 
this section, we assume a certain familiarity with the 
relational data model as we Riera to describe the four 
basic constructs of SOL: SELECT. INSERT, DELETE, =/and 
UPDATE. If the reader desires a review of relational 
theory, there are several very good texts available. In 
particular, we recommend Date (Ref. 9] and Uliman CRef. 15]. 
A discussion of the mapping between the relational data 
model and the attribute-based data model can be found in 


Raner jee CRef. 4]. 


1. The SBELECR buen» 

Data retrieval, which 1s represented syntactically 
as a SELECT-FROM-WHERE block, is the most basic operation of 
S@L. Mapping indicates that a known quantity (STATUS = 30) 
is to be transformed into a desired quantity (‘(SNAME) by 
means of a relation (SS). The attributes to be returned are 
listed ian the SELECT clause (the built-in functions COUNT, 
SUM, AVG, MAX, and MIN may be applied to these attributes). 
The FROM clause indicates which relation or relations are to 
be searched. The WHERE clause specifies the retrieval 
conditions. As an example, if we desire to obtain the names 
of suppliers whose status 15 30, we may use the following 


query: 


SEE Ti SNAME 
FROM = 


WHERE STATUS = 30 


The SEECEET construct allows’ the user great 
flexibility in data retrieval operations. The user can list 
several relations in the FROM clause in order to obtain 
values selected from more than one relation (JOIN 
operations). The WHERE clause can contain any number of 
predicates including the six standard relational operators 
(=, “=, ey F=, &, and <=), and the Boolean operators (AND, 


OR, and NOT). Parenthesis may be used to indicate a desired 


order of evaluation. The set comparison operators IN, ANY, 


oe 


and ALL may aiso be used in the WHERE clause. (We 
investigate the use of these operators in Chapter IV.) 

There are many other possible variations to the 
SELECT operation including the extremely useful nested 
mre tLT. In the nested SELECT, the result of one SELECT 
request 1S used in the WHERE clause of another SELECT 
request. (The nested SELECT is thoroughly described in 
chapter V.) 

ms The INSERT Query 

The INSERT request allows the user to insert a new 
tuple (row) or set of tuples into an existing relation 
(table). Insertion of a single tuple can be accomplished 


through the use of a query such as 


INSERT INTO S: 


2'S4',’Rollins’, ‘40°, ‘Newport’ > 


In this example, all of the attributes are present and in 
feme correct order. If some attribute values are unknown, 
those attributes for which values are being inserted must be 
listed following the relation name. A SQL INSERT statement 
may also evaluate a SELECT request and insert the resulting 
set of tuples into an existing (or temporary) relation. AN 


example of such an INSERT operation is as follows. 


te 
put 


INSERT PING. veitee 
SERGE PH 
FROM om 


WHERE S# = ‘'S2’ 


This enters into TEMP part numbers for all parts supplied by 
supplier S2. 
as The DELETE Query 

The DELETE specifies tuples to be removed from the 
database. The tuples are indicated by means of a WHERE 
Clause that 15 syntactically identical to the WHERE clause 
of a SELECT. Comserruct. As an example, to delete supplier 
number five from the supplier relation, we may use the 


following query. 


DELETE = 


WHERE S# = °S5° 
We may also delete all shipments with the query 
DELETE SP. 


The SP relation 1s still known, but 1t 1S now empty. 
4. The UPDATE Query 
The UPDATE request is syntactically similar to the 
DELETE request, except that a SET clause is used to specify 
the updates to be made to the selected tuples. New 
attribute values contained in the SET clause may be stated 


as constants, as expressions based on the original value of 


ty 
bh 


Emee attribute, or as 
UPDATE request 156 
UF DATE 
Set 
WHERE 


This doubles the status 


nested queries. An exampie 


2 
STATUS = 2 * STATUS 


CITY = ‘London’ 


of all suppliers in London. 


4 
CA 


of 


am 


III. REVIEW OF BASIC MAPPINGS 


As we have described in Chapter [I, the four primary 
database operations of the Structured Query Language (S@L) 
are SELEG |. INSERT, DELETE, and UPDATE. Macy (Ref. 81] has 
shown that for a subset of simple, single-relation S@L 
queries of all four types, there exist direct mappings into 
requests of the Attribute-based Data Language (AHDL). These 
mappings are fundamental to all further S@L—-to-ABDL 
translations introduced in this thesis. Therefore, in the 
remainder of this chapter, we provide a review of these 
hasic mappings as defined by Macy. We explain the mappings 
both graphically and in text. Each graphical presentation 
will display the general forms of the S@L- and ABRDL 
constructs, and the mappings between them (such as Figure 4, 
which depicts the SELECT to RETRIEVE mapping Sample 
translations, utilizing our suppliers-and-parts database, 
will be presented in the text. The subset of SQ@QL, for which 
translations are described, contains those operations that 
Macy has determined can be directly supported by MDEBS'= and 
ABDL. In the next chapter, we will show that SELECT 
requests involving set comparison operators can also be 
directly supported. In subsequent chapters, we describe 
translations for SQL constructs such as the nested SECEET 


which involve multiple ABDL constructs. 


Prior toa describing the specific S@L to ABDL mappings 
(e.g., SELECT to RETRIEVE), we discuss two general types of 
mapping identified by Macy: Syntactic—substitution mapping 
and Conversion mapping. Syntactic—substitution mappings are 
accomplished By simple substitution of syntactical terms. 
Mappings requiring only substitution are denoted by 32 
directional arrow tlabeied with a square containing the 
letter §S& (e.qg., the mapping between the reserved words 
SELECT and RETRIEVE in Figure 4). Conversion mappings are 
accomplished By cambining a clause from an S@L query with 
information about the ARBDL data structure to create the 
equivalent clause of the ABDL construct. Mappings requiring 
conversion are denoted by a directional arrow labeled with a 
triangle containing the letter C (e.g., the mapping between 
the S@L FROM and WHERE clauses to the ABDL Query in Figure 
4). We will describe conversion mappings in more detail as 
we present each for the SOL to ABDL translations. For an 
extensive discussion of the basic mappings described in this 
Chapter, the reader is referred to Macy [Ref. 8]. 

A. MAPPING THE S@L SELECT QUERY TO THE 

ABDL RETRIEVE REQUEST 

The mapping from the S@L SELECT to the ABDL RETRIEVE is 
depicted in Figure 4. The mapping proceeds as foliows. 
The reserved word SELECT 1S mapped by syntactic substitution 
to the reserved word RETRIEVE. The sel _expr_ilist maps 


Girectiy to the target_list. A conversion mapping 15 


(4 
Of 


required to translate the FROM and WHERE clauses to the ARDL 


query clause. Tnis 1s accomplished by creating an equality 

keyword-predicate for the relation_name, e.g., FILeaae 

reiatian_name. This mew predicate is combined with the 
SEEBET 


sel _ expr list 








FROM Relation name 
WHERE boolean] 

C[GROUF BY field _namej 
RETRIEVE 


query 


7 
CHY attribute] 


target _ list 


Figure 4. Mapping the S@L SELECT to the ABDL RETRIEVE 


other predicates listed in the boolean expression to form an 
equivalent AHDL query clause. This conversion is called a 
query-conversion mapping. The GROUP BY construct maps 
directly to the BY construct. As an example of a SELECT to 
RETRIEVE translation, the following S@L SELECT will, for 
each part supplied, get the part number and the _ total 


quantity supplied of that part. 


Ud 
O- 


SELECw P# , SUM (COTY) 
FROM oF 


GROUP BY F# 


An equivalent ABDL request is 


RETRIEVE (FILE = SP) <P#H,SUM(QTY) > BY FR 


B. MAPFPING THE SQL INSERT QUERY TO THE ABDL INSERT REQUEST 
The mapping from the S@L INSERT to the ABDL INSERT 15 

depicted in Figure 5. The mapping proceeds as follows. The 

reserved word INSERT 1s the same for both requests. 4 


conversion mapping, referred to as a record-conversion 


mapping, in this case, 1S required to translate "INTO 
relation_name insert_spec” into the ABDL "record". As we 


have seen in Chapter II, the ABDL record 1s a series of 
attribute-value pairs, the first pair of which identifies 


the file name. This mapping, then, can be accomplished by 


PISERT 
a INTO relation name 
b insert _spec 
moe T 
record 


Figure 93. Mapping the S@L INSERT to the ABDL INSERT 


constructing attribute-value pairs for the relation/file and 


relation/file name and for the values of the attributes 


listed in the insert _spec. As an example of an SQL INSERT 
to ABDL INSERT translation, the following S@L INSERT query 


will add part F7 (name ‘Washer’, color ‘Grey’, weight ‘2’, 


city ‘Athens’‘) to relation/file P. 


INSERT INTO FF: 


“°P7* , Washer’, ‘Grey’','2', Athens‘’> 


An equivalent ABDL request 15 


INSERT (¢FILE,P>,¢P#,P7>,<PNAME,Washer >, 


<COLOR,Grey?,<WEIGHT,2>,<CITY,Athens>) 


C. MAPFING THE S@L DELETE QUERY TO THE ABDL DELETE RE@QUEST 


The mapping from the SG@L DELETE to the ABDL DELETE is 


depicted in Figure 6&6 The mapping proceeds as follows. The 
reserved word DELETE is the same for both requests. 
The query-conversion mapping, as described in Section A, 
DEMETIE 
relation_name 
CWHERE booleand 
Vv 
DEBETLE 


query 


Figure 4. Mapping the SQL DELETE to the ABDL DELETE 


1s used to translate "relation_name" and "WHERE boolean" 


into the ABDL query clause. As an example of an S@L DELETE 


to ABDiL DELETE translation, the following S@QL DELETE query 


will delete supplier Si from the suppliers relation. 


DELETE. S 


WHERE S# =‘'Si’ 


An equivalent ABDL request is 


DELETE VteIite = Ss) /\ (St 


al) 


D. MAPFING THE S@L UPDATE QUERY TQ THE ABDL UPDATE REQUEST 
The mapping from the SOL UFDATE to the ABDL UFDATE 1s 


depicted in Figure 7. The mapping proceeds as’ follows. 


UPDATE 
relation_name 
set clause list 
CWHERE boolean) 


UPDATE 





modifier 


Figure 7. Mapping the SQL UPDATE to the ABDL UPDATE 


The reserved word UPDATE is the same in both requests. As 
1n Sections A and C, the query-conversion mapping is used to 
translate "relation_name" and "WHERE boolean" into the ABDL 
query clause. This conversion 1S common to the 


Seeel tl /RETRIEVE, DELETE, and UPDATE translations. The 


I 
0 


component "set ciause list" directly correlates to the ABDL 


"modifier", 2.2@., both constructs specify how the records 


being modified are to be updated. To accomplish this 
translation, the modifier conversion mappin is used. 
The conversion required iS a restructuring of SOL 


set _ ciause_115t constructs into acceptable ABDL format. The 
modifier—-conversion is similar to the query-conversion. We 
now present an example of the conversions that are required 
im the translation of an S@L UPDATE to an ABDL UPDATE. it 
we desire to double the status of all sSupplierS in london, 


we may use the following S@L query: 


UPDATE §& 
SET STATUS = 2 * STATUS 
WHERE CITY = ‘London’ 


An equivalent ABDL request 15 


UPDATE ((FILE = S) A (CITY = London)) (STATUS = 2 * STATUS) 


AO 


Pye SelestliONS With SET HEMBERSHIF OPERATIONS GON 


rr re 


SINGLE RELATIONS 
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AS we nave seen, the condition following the WHERE 
clause in SQL SELECT operations may include the normai 
comparison operators, iene © 5 wae, STC: Macy CRef. 8] nas 
shown that MDBS supports simole, singlie-relation retrieval 
operatians using these comparison operators. SQL aliows the 
use of several additional comparison aperators. Three of 
these, IN, ANY, and ALL, deal with the set membership, and 
Meewor particttiar interest to us as we investigate possibie 
extensions to the subset of SUL operations whose intertaces 
were proposed by Macy. 

In this chapter we show how qualifications using DMs 
ANY. and ALL can be supported by HMDES. We first cansicer 
the simple case where set members are enumerated ain the 
query. Some of tne exampies we provide herein may not 
appear very useful. However, they will serve to iilustrate 
the mechanics of SELECT operations using thes® compariscen 
operatcre. Their usefulness will become apparent in Chapter 
Y, when we use them in retrievals invoiving muitiple leveis 
Ot nesting. 

In sections A, B, and C, we formally define the 
comparison operators IN, ANY, and ALL, respectiveiv. AS 
noted by Chamberlin, et. al. CRef. 14], English langtuage 


definitions of these operators are, at best, ambiguous. de 
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hall, nevertheless, attempt to explain them in text orior 


fl 


to providing a Clarifying definition in predicate eee AN 
exampls af a SELECT query will then be given for each case. 
The result relation of each of these examples wiil be 
pravided in order to further clarify the uses of these 
operators. FS in previous chapters, eur examples specify 


retrievals of data contained in LPate’s database (defined in 


Chapter fe We will continue to utilize this database 
througheut this thesis. Aqain, note that seme of our 
examples are tekken directly from Date (CRet. 9]. In Sections 


Dee. and F we express IN, ANY, and ALL in the ABE 


requests. 


A. IN-MEMBERSHIP SPERAt Mains 


The comparison operator, IN, can be thought of as the 


set membership operator. C . Correspondingly. NOT IN is 
equivalent to an 
i The Set Membership Operator, JIN’ 
The operator, IN, 1S evaluated as follows. The 


condi trem, A IN BB, evaluates to be true if and oniy if the 
value of attribute Ais equal to at least one value in the 
enumerated set EB. The formal definition in predicate logic 


tollows: 


We (x € A ¢==> dy (y E B | x = y)) 


if we wish to ebtain supplier numbers tor 


fut 
us 


EXAMPLE 
suppliers Smith and Jones, Wwe may use the 


following query: 


Sere T S#,SNAME 
FROM 5 


WHERE SNAMNE IN (Smith,Jones) 


The result relation is: 





et The Set Membership Operator, ‘NOT IN’ 

Tne operator, NOT_IN, 15 evaluated as follows. The 
condition, A NOT_IN BH, evaluates to be true if and only if 
the value of attribute Ais nat equal te any value in the 
enumerated set &#. The formal definiticn in predicate logic 


foliowes: 


Vx (x € a c==>Vy wes | x ~= yy) 


EXAMPLE 2: If we wish to obtain supplier numbers for 
suppliers who supply some parts, but da not 
supply parts PS or P4, we may use the following 


query: 


See | 2 
FROM oF 


WHERE free! IN <Ps5P4) 


The result relation 1s: 





Ete ANY-MEMBERSHIP OGFERATIONS 

The comparison operator, ANY , 1s used in conjunctian 
with the six standard relational operators, =, “=, t=, *=, 
\~¢ Wana =. Tt specifies variations on the theme of set 


membership as explained in the following subsections. 


ie The Set Membership Operator, '=ANY~ 
The operator, =ANY, 1S interchangeable with the 


operator, IN. The condition, A =ANY B, evaluates to be true 
if and only if the value of attribute A 1s equal to at least 
one value ain the enumerated set BB. Example i and the 
OoOredicate logic definition given for the operator IN apply 
equally to =ANY. In subsequent examples involving set 
membership, we shall use IN rather than =ANY. 

>. The Set Membership Operator, |'*=ANY! 

The operator, “=ANY, 1S evaluated as follows. The 
condition, “=ANY B, evaluates to be true 1f and only i1f 
the value of attribute A 1s not equal to at least one value 
in the enumerated set &. The tormal definition in predicate 


iogic follows: 
: Wie (x E A <==> dy (y € 8 | x “= y)) 


EAMMPILE, 55 Tf we wish to obtain supplier numbers for 


suppliers who supply some parts, but do not 
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rt 


ID 


supply both parts Fl and P2, we may use h 


following query: 


SEEECT S# 
FROM SP 


WHERE PH “=ANY (FP1,P2) 


The result relation is: 
54 
=. The Set Membership Qperator, ‘<=ANY~ 

The operator, <=ANY, 15 evaluated as follows. The 
condition, A *«<=ANY B, evaluates to be true if and only 1ft 
the value of attribute Ais less than or equal to at least 
one value in the enumerated set &. This implies that the 
value of attribute Ais less than or equal to the maximum 
value in the set B. f=ANY, then, 15 not particularly useful 
in the case of enumerated sets. The operators +=, =, and « 
are similarily of limited value when sets are enumerated in 
the query. As previously stated, the usefulness of these 
operators will become apparent when we discuss queries in 
which the resuits of one SELECT operation determine the set 
members in the WHERE clause of another SELECT operation 
(nested SELECT). The formal predicate logic definition of A 


<=ANY B follows: 


We (x E A <==) dy (yc. EF x <= y)) ==> 


oe (x € A <==> x <= max TH}) 
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As can be seen fram the predicate logic definition, when 
using the operator, <=ANY , 2t 1S logically unnecessary to 
ii1st more than one value (the maximum value) in the 
enumerated set #. A similar comment 1s applicable when 
USING Ss=ANY, <ANY, or ANY. However, im anticipation of our 
nested SELECT discussion in Chapter V, exampie queries 
utilizing these aperators will each contain an enumerated 
set having more than one member. The additional Values 
listed in tne set are superfluous. However, they will nelp 
Gemonstrate the differing results obtained through the use 


of the ANY and ALL operators. 


EXAMPLE 4: If we wish to obtain supplier names for 
suppliers whose status 1S not larger than 30, 


we may use the following query: 


SELECT SNAME 
FROM 5 


WHERE STATUS <=ANY (€10,20,30) 


= 


me resciuit relation 1s: 





4. The Set Membership Operator, | S=ANY! 
The operator *=ANY 1S evaluated as follows. The 


condition A ?=ANY B evaluates to true if and only i1f the 
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value of attribute A is greater than or equal to at least 
one value in the enumerated set &. This implies that tne 
value of attribute A is greater than or equal to the minimum 
yYalue in the set &. The formal definition in predicate 


logic follows: 


EXAMPLE 3: If we wish to get supplier names for suppliers 
whose status 1s not less than 10, we may uss 


the following query: 


See Ee , SNAME 
FROM 3 


WHERE STATUS 2=ANY (10,20, 30) 


The result relation 1s: 





3. the Set Membership Operator, “<ANY~ 

The operator, <ANY, 1s evaluated as tollows. The 
condition, A “ANY HB, evaluates to be true if and only if the 
Value of attribute Ais less at least one vaiue in the 
enumerated set BE. This implies that the value of attribute 
A 1s less than the maximum value in set 4&6. The formal 


Predicate logic definition follows: 
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EXAMELE=*s: If we wish to obtain supplier names’ for 
suppliers whose status is less than 30, we may 


use the following query: 


SEEET SNAME 
FROM S 


WHERE STATUS <ANY (€10,20,20) 


The resuit relation 1s: 





6&. the Set Membership Operator, ‘'?ANY’ 

The operator, 2ANY, is evaluated as follows. The 
condition, A -ANY B, evaluates to be true if and only if the 
value of attribute A 1s greater than at least one value in 
the enumerated set &. The formal predicate logic definition 


foilows: 


WE (x € A <==> dy (y € #B x > y)) ==> 


Ve (x @ A <==> x > min (BD) 


EPAGMPLE 7: If we wish to obtain supplier names for 
suppliers whose status 1S greater than 10, we 


may use the following query: 
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SSL eEee SNAME 
analy = 


WHERE STATUS FANNY (19,20,30) 


The result reiatian 1s: 





fee PLL-MEMBERSHIP GPERATIONS 

Like the comparison operator, ANY, the operator, ALL, 15 
used in conjunction with the six standard relatianai 
ceerators. It also Specifies variations on the set 


membership theme. 


ee The Set Membership operator, [=ALL’ 
The operator, =ALL, 1S evaluated as follows. The 


condition, A =ALL H, evaluates to be true if and oniy i1¢ the 
Value of attribute Ais equal to every (each) value in the 
enumerated set 8B. The formal predicate logic definitian 


#ollows: 


ee ix € A <==> dy iy eo | x* = yd) “A 
Vy (y & B <==> 4x (x E A | ae y)) 


From this definition, it 1S apparent that the set B, whether 
manually enumerated cr determined by the results of an inner 
SELECT, would contain only one value (or duplicates of that 


value). Therefore, since we can always use a condition of 
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the form WHERE STATUS = 30, we shaili net use the operator 


SALL ion further discussion or examples. 


tom 


2s Tne Set Membership Qperator, '*=ALL" 

The operator, “=ALL, 1S interchangeadle with the 
operator, NOT_IN. The condition, A Y“=ALL B, evaluates to be 
true if and only if the value of attribute A 1s not eaquai to 
every value in the enumerated set H. In other words, there 
IS nO Value in the set H to which the value of attribute A 
is equal. The predicate logic definition of NOT_IN 15s 


repeated for clarity: 
Vx Cera caa> Vy (vac | x “= y)) 


The query given in example 2 (with Y=ALL substituted tor NOT 
Ih) 1S applicable. In subsequent examples involving set 


membership, we shall use NOT IN rather than Y=ALL. 


Se The Set Membership operator, ‘z=ALL’ 
The operator, “=ALL, 165 evaluated as follows. The 
GCOngd1 tien, Ae SA ft, evaluates to be true at and only if 


the value of attribute A 1s less than or equal to every 
value in the enumerated set &. This implies that the value 
of attribute A is <= the minimum value in set &#F. The 


Predicates logic definition follows: 


Again, as in the case of the operator ANY, our degenerate 


SQ 


Semples Werlicamg Ene operators «<=ALL, 2=ALL, “ALL, and 
SALL will be presented with enumerated sets containing mare 
tnan one member teven though, Logically, only one member is 


necessary). 


EXAMFLE 98: Tf we wish to obtain supplier names for 
suppliers whose status is not greater than 14, 


we may use the foilowing query: 


Sele | SNAME 


FROM Ss 
WHERE SraruSe-=ALt (10,20, 50) 
Tne result relation i185: SNAME 


Jones 





Mote that the difference between the comparison operators 
ANY and ALL 1s readily apparent when we compare this example 
with example 4. In example 4, the cperator, <=ANY, allows us 
to obtain supplier names for suppliers whose status 15S not 
larger than 306. The result relation in that example 
inciudes the names of all five suppliers. 
4. The Set Membership QGperator, /s=ALL’ 

The operator, 3=ALL, 15 evaluated as follaws. The 
condition, A -s=ALL BB, evaluates to be true if and only if 
tne value of attribute Ais greater tnan or equal to every 


Yalue in the enumerated set #. This implies that the vaiue 


of attribute A 1s greater than or equal to the maximum vaiue 


PARTIPICE  ~Sts if we wish to obtain supplier names tar 
suppliers whose status is at least 36 we may 


ie 9 


use the following query: 


SSR EE SNAME 
FROM = 


WHERE STATUS S=ALL (10,20,350) 


Pa 
RHqgams 


= The Set Membership operator, ‘zALL’ 





The resuit reiation is: 





The operator, <ALL , 1S evaluated as foilows. The 
condition, A zALL B, evaluates to be true if and only if the 
value of attribute A is less than every value in tne 


enumerated set #. The predicate logic definition foiliows: 


Vs (x E A <e=> Vy (y EB | x <€< yd) ==> 


Ve © A os eee 


Eerie ro: if we wish to abtain supplier names =“*-for 
suppliers whose status is less than 10, we 


may use the following query: 


SEL Eta SNAME 
FROM = 


WHERE STATUS *ALL ¢€10,20,30) 


The result reitation is: SNARE 


Note that this is the empty relation. There are no 
suppliers whose status is less than 10. 
oe The Set Membership Gperator, §sALL' 
Tne operator, PALL, 1S evaluated asc follows. The 
condition, A -ALL B, evaluates to be true if and anly if the 
Valve of attribute Ais Greater than every value in the 


enumerated set &#H. TRE predicate logic definition foilaws: 


ae ix EA ca=> Vy (y € B | a 


EXAMPLE itl: If we wish to obtain supplier mames tor 
suppliers whose status 1S greater than 30, we 


may use the following query: 


Smee | SNAME 


FROP S 
WHERE STATUS AIM (10,20,30) 
The result relatian 1s: SNAMRME 





AS in example ti this is the empty relation. There are no 


suppliers whose status is greater than 30. 


D. EXPRESSING IN-HEMBERSHIP OFS ai oNS eee 

In this and the foilorwing two sections, we present ABDL 
tranmsiations for the examples given in sections 4, HB, and C. 
Each S@tL example will be repeated, fOlioqwed by the ABDL 
translation. 

Ws The Set Membership Operator, JIN’ 


The SOUL query presented as example i is 


Sie St eth ie 
ae = 


WHERE ONAME IN (Smith,Jones) 


fur croposeda SQL interface would pravide the follawing 4BDL 


translation: 


REVERB LEVe 2 ee Tiere 


S) A {SNAME = Smithd) WV 


I 


(CE Tie SS? (CSNeNE Jones:}) «<S#,SNAME> 


Gne& conjunction is created for each vaiue in the enumerated 
Set, cantaining an equaiity predicate. Tne ABDL reguest 


wild have as many conjunctions as there are values in the 


a The Se 


rt 


Membership operator, “NOT IN’ 





The S@L query presented as example 2 15 


See 2# 
rm Site 


WHERE P# NOT IN (P3,P4) 
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- 
7 
i 


ABDL translation is 


pepe vee Eee SP) AN (RH = PSY “A (P# “= F4) < SH? 


One predicate of the form (attribute *= value) is created 
for each value in the enumerated set. The ABDL request will 


ogicai AND of 


jms 


contain a single conjunction, which 1s the 


these predicates. 


E. EXPRESSING ANY-MEMBERSHIF OPERATIONS IN ABDL 


Ue The Set Membershion Operator, ‘'=ANY" 
As previously defined, =ANY 16 equivalent te IN ana 


Willi not be included in our set of allowable SDL corstructs. 


Ei Tne Set Membership Qperator, |™=ANY! 


Tne S@L query presented as example 3 is 


FROM SE 


WHERE FH ~=ANY (P1i,P2) 


The AHBDL translation 15s 


Belin Jee. ( (FILE Si NN tH = Er) > Vv 


(SIRE e=—S5P) A SMPs == F2))) <S#> 


Il 


Dne conjunction 1S created for each value in the enumerated 
set, containing a predicate of the form (attribute “= 
Value). 


he The Set Membership Qperator, ‘“<=ANY 


The S@L query presented as example 4 is 


an 
cn 


eee 3 


WHERE STATUS <=ANY (imei 2i ee” 


The ABDL transiation is 


RETRIEVE (tFILE =S) “A WstiAlus 4] 22) s ens, 


OQne predicate of the torm (attribute “== max_value) is 
created. The ABDL reguest will contain a single 
conjunction. Note that the S@L interface recognizes that 


the condition in the WHERE clause evaluates to true if and 


ct 


oniv if aA supplier’s status is less than or equal to Wa 


rt 


least one of the status vaiues in the enumerated se 
Cimplying that that supplier’s status is less than or equal 
to the maximum value in the set). Therefore, oaniy the 
maximum value, 20, 1s utilized in the ABDL translation. 

4. The Set Membership Operator, 's=ANY! 


| at 


The S@L query presented as example 3 15 


SELECT SNAME 
FROM S 


WHERE STATUS -=ANY (10,20,50) 


The ABDL transiation is 


RETRIEVE ((FILE = S) A (STATUS 7= 10)) —Sh7iie 


One predicate of the form (attribute s= min_value) is 


created. The ABDL request will contain a single 


meh) JMG = LOM. As in the ‘“<=ANY° case, only one vaiue of the 
enumerated set in the WHERE ciause is utilized in thre ABDL 
translation. In this case, the minimum value, Li is 
utiiized. 

as The Set Membership Operator, ~“<ANY 


The S@L query presented as example 6 15 


ee aa SNAME 
FROM 2 


WHERE Siete ewminy 820), 205.50) 


The ABDL translation is 


Nene Eee eE = 5S) AetSTATUS «< 30)) <SNAME> 


One predicate of the form (attribute << max_value? is 
created. The ABDL request will contain 2 single 
conjunction. 

S. The Set Membership Operator, ':ANY! 


The S@L query presented as example 7 is 


SRE SNAME 
FROM = 


WHERE STATUS =ANY (10,20,30) 


The ABDL translation is 


RE Tinie eet TLE 


S) A (STATUS > 10)) <SNAME> 


ne predicate of the form (attribute - min_value) 1s 
created. The ABDL request will contain a singie 


conjunction. 


F. EXPRESSING ALL—-MEMNBERSHIP GFPERATISNS eS iieeee 
doc jhe Set Membership Operator, '=ALL’ 

AS previously defined, use of the operator, =ALL, 15S 
equivalent ta using the standard equality operator, =. We 
wili, therefore, not include it in our set of allowable SG@GL 
constructs. 

cae The Set Membership Operator, 'Y=ALL' 

AS previousiy defined, ~=ALL 1S eguivalent to 
NOT IN and will not be inciuded in our set of allowable S@L 
renstriucts. 

hs The Set Membership Operator, “2=ALL 


The S@L query presented at example 8 15 


SERBEE SNAME 


FROM 5 


WHERE QTATUS =<=ALL (10,20,30) 


The ABDL translation is 


RETRIEVE ((FILE = S) A (STATUS <= £0)) 2st 


One predicate of the form (attribute “<= min_value) is 
created. The ABDL request will contain a single 
conjunction. AS 1m the ‘’<=ANY’ case, the translator in our 


SOL ainterface utilizes only one value from the enumerated 
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set. Note that in this case, the minimum value, ne 1s 
chosen, whereas, in the ‘<=ANY’ case the maximum vaiue, 30, 
Ps chosen. 
4. The Set Membership Operator, '2=ALL' 
The S@L query presented as example 9 15 
Sie Gh SNAME 
FROM = 


WHERE STATUS 2=ALL (10,20,30) 


The SEBDL transiation 1¢ 


BEVTRIEVESC((FILE = S) “A (STATUS += 30)) <SNARE> 
One predicate of the form (attribute f= max_value} 1s 
created. The ABDL request will contain a Ssingie 
fon Junction. As in the ‘s=ANY' case, oniy one vaiue of the 
enumerated set 15 utilized. In this case, the maximum 


Value, 20, 16s utilized in the equivaient RETRIEVE construct. 
We recall that the minimum value, 1 was utilized in tne 
“S=ANY" case. 

ys The Set Membership Operator, ‘<ALL’ 


The SQL query presented as exampie 10 is 


Sole | SNAME 
FROM = 


WHERE See omaeiia CLO, 20, 30) 


The ABDL translation is 


RETRIEVE “Geaibke — a0 (STATUS < 10) 252 5newe 


One predicate oft the form Cattribute << min _vainue) 1s 
created. The ABDL request wiil contain a singie 
GEOR JUREE? om. 


oe The Set Membership Operetor, '>ALL’ 


The S@#L query presented as example 1115s 


SELEY SNAME 
FROG = 


WHERE STRTUS rr 11e.2oe, 


The S4&8DL translation is 


RETRIEVE ((FILE = 5) A (STATES |= 2005-2. 


One predicate of the farm (attribute = max_value) is 
created. The ABDL request will contain a single 


conjunction. 
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Peeeoe teu 1 ONS WITH SET MEMBERSHIP OGFERATIONS ON 
MUTI Pee RELATIONS 

In the preceding chepter, we have describded Stil SELECT 
queries which utilize the comparison operators, [N. ANY, and 
ALL in the WHERE clause. These are simple, single-reiation 
queries in which the associated sets are enumerated. Wo now 
discuss the nested S@L SELECT queries tor nested mapping) in 
which the result of one mapping 1s used in the WHERE clause 
of another mapping. In other words, the membersnip of the 
eet following IN, £ANY, Or ALL in one SELECT operation 15 
determined by the result set of another SELECT. We iwili 
describe the operation of two-ievei, three-level and n-ievel 
nested SELECTs in Sections A, Ete and C, respectiveiy. im 
aeetion 0D, we show how the nested S@L SELECT is transiated 


Intao 2a series of ABDL RETRIEVES. 


freee VESTED SELECTIONS WITH TWO RELATIONS 

AS previously stated, In a nested SQL SELECT, the 
results of one SELECT operation are used in the WHERE ciause 
Of another SELECT operation. We view the former SELECT as 
the inner (level of) SELECT, and the latter as the outer 
(level of) SELECT. Figure 8 depicts an example of a two- 
level nested SELECT operation. This particular example 15 
chosen for its similarity to one of our examples in Chapter 


IV (1.@., Example 6) which utilizes the operator, <ANY, in 
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M 


on yurction with a manually enumerated set. In tne 


el 


eanqensrate case presented in that example, the operator, 
COaNY , appeared ta be of marginal usefulness. The usefulness 
ef this ana similiar operators (e.qg., *®=ANY, S=ALL) in the 
nested SELECT, will now become apparent. 

Botn our current example in Figure 8, and Example 6 of 
Chapter [VY resuit in a set of supplier numbers for suppliers 
MIitnh status value less than the current maximum status value 
im the S table. In our degenerate example, we must knaw 
(1.2@., enumerate) that that value is 30. In Our present 
exampie, we allow arm inner SELECT to obtain tne status value 
for each supplier number in the S table. Ry employing an 
imner level of SELECT, we are free from enumerating the 


Yaiues. 


Seren? at 


Auter 
FROM S 
SELEET 
WHERE STATUS <ANY 
SELECT STATUS 
Inner 
FROM Ss? 
SEREEW 


Figure @&. A Two-Level Nested SELECT 


Frocessing of the two-level nested SELECT in Figure 3& 
proceeds as follows. First, the inner SELECT retrieves all 
status values in the S&S table. The result of this SELECT 


the set (with duplicates) of status values (20,10,350,20,20;. 


The outer SELECT then selects supplier numbers FROM table S&S 
WHERE the status value 1s less than at least ane cf the 


Yalues in the apove result set. The result relation 1s 





Bee MESTED SELECTIONS WITH THREE RELATIONS 

We now cdescribe a three-level nested SELECT. We present 
an example which demonstrates the usefulness of the 
set/comparison operator IN, and of multi-level SELECTS in 
General. In the course of providing the requested data, 
this three-level SELECT chooses data from each of the three 


¢ a 


rabies whicn comprise our sample database. The request is 


rr 
IJ 


get supplier names for suppliers who supply at least one 


Bea part. The query 1s presented in Figure @. 


SELECT SNAMNE 
Dutermast 

FROM 5 
Seer ECT 

WHERE S# IN 


See 3H 


Inner 
FROM Se 
SELECT 
WHERE PH IN 
(SEEECT FH 
Innermost 
FROM F 
SELECT 
WHERE COLOR = ‘RED’?)) 


Figure 9. A Three—-Level Nested SELECT 


Fracessing aft the query in Figure 9 proceeds as 
follows. 

Step oe The innermost SELECT retrieves part numbers 
(F#) from the parts relation (FP) where the color 
of the parts 1s red. The result of this SELECT 
is) the set of part numbers (F1,F4,F4;. 

Step 2: The next SELECT retrieves supplier numbers (S#) 
from the shipments relation (SP) where F#s are 
In the result set af step i. The result of this 


SELECT is the set of supplier numbers (51,852,54;. 


Step -s:s The outermost SELECT retrieves supplier names 
(SNAME) from the suppliers relation CS) where 
5#s are in the result set of step 2. The result 


relation passed to the user 1s 





€. ~NESTED SELECTIONS WITH NSREEeT Tens 

Although it seems unlikely that many users would utilize 
a nested SELECT of more than 2 or 3 levels, tne subqueries 
can be nested to any depth. The form of an n-level nested 
SELECT is shown in Figure 10. 

The SET _OPR in Figure 10 refers to the various forms of 
our comparison operators IN, ANY, and ALL. In the next 
section, we describe the translation of nested SELECTs to a 


series of AHDL RETRIEVES. Therefore, 1t 1S important that 
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SELECT sei_expr_list 
level i or 
FRO relation name _}i outermost 
SEREE | 
WHERE attribute namel SET_OFR 


(SELECT attribute _namei 


2) 
Pe 


level or 
i 


FROM relation_name_= inner 
See eer 
WHERE attribute _name2 SET GFR 


(SELECT attribute _namein-1) 
level n or 
FROM relation_name_n innermost 
SELECT 
WHERE  Vecond2 tion... . > 


Figure 10. An N-Level Nested SELECT 
we note the following information as succinctly stated in 


CRef. 11. 


"The math level is the annermost SELECT. The ist 


levei is the outermost SELECT. The sel _esxpr_list oft each 
immer SELECT, 1.©e., a SELECT lower than ievel i, contains 
a single attribute name, which 1s the same as the 


attribute name used in the qualification of the next- 
higher level SELECT. The relation names at any two leveis 
may be the same." 


Er PeeawslatinG NESTED SELECTI@NS 10 €@ SERIES OF ABDL 
RETRIEVALS 
AS shown by Macy CRef. She there ex1sts a 
straightforward mapping between the SOL SELECT operation and 
the ABDL RETRIEVE operation. We can, therefore, simulate 
the mested SELECT with a series of RETRIEVES, eacn 
succeeding operation feninG the results of the previous one. 


Thus, referring to our three-level example of Section H, the 


ABDL equivalent of the innermost S&S 


RETRIEVE (C(FILE = PF) A erie 


Tne resulting set of part numbers 


the next ABDL operation as foilows 


RETRIEVE -Ct-CPr lib eS a, ets 
‘(PILE gee or le ee th 
(CP ILE. =. See te 


The last retrieve (corresponding 


oo \ 
\—— S% 


our ample) then uses the resulti 


151,522,543; as follows: 


RETRIEVE <'%FPIee = Sr A Sse 
(XPILE -="“S)> 7 Yor — 
((F ILE) Ss) ee (Sie 


It 15S intended that the operation 


transparent to the S@L user. 


Yalues of the attribute 


returned to the user in the form 


previously described for our tf 
example of section BK. 
je have now demonstrated 


retrievals involving the nested 


nested operations may include us 


IN, ANY, and ALL. The sequenc 


5&6 


SNAME 


ELEC Tae 


= 'RED‘')) <Pi#> 


{Pi ,F4,F6s 1s tnen used in 


= Fi) Vv 
= F4)) V 
= P6))) <Sit> 


to the outermost SELECT in 


nq set of supplier numbers 


Sid) V 
S20 WY 
S4))) <SNAME = 


of our SOL interface pe 


Therefore, the resulting 


(‘Smith,Jones,Clark)> are 


of the result reiation 


hree-level nested SEL Ei 


the operation of data 


SE@eae construct. These 


e of the various forms otf 


e of actions necessary to 


translate the nested S@L SELECT to aA series af AHDL 
RETRIEVES has been described. In the next chapter, we 
present our proposais for the implementation of these 


translations. 
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ie IMPLEMENTING NESTED SELEET tas 


The loaicali process Dy whicn a nested SOL SELECT is 
translated to a series of ABDL RETRIEVES has been described. 
It 185 clear that each SELECT level, from the innermost to 
the oautermast, must be translated to an ABDL RETRIEVE. 
Then. each RETRIEVE is processed in turn, with each 
succeeding aperation utilizing the results of the previous 
RETRIEVE in the QUERY part. In Section A of this chapter, we 
present the algorithms for Puilding the ABDL QUERY. in 
Section #, a simple iterative structure for controlling the 
execution of n-level nested SELECTs 1s provided. Final lve 
im, -SDeSet ram ou, the overall software structure of our Ser 
interface will be proposed. Mote that, as we continue our 
bottom-up Investigation and Include additional SOL 
operations In our set of allowable constructs, tne 
functionality of this structure may be augmented. However. 


1t 1S expected that the software structure will remain 


PAtect: 


A. ALGORITHMS FOR BUILDING THE AHDL QUERY 


We recall that the Query part of ABDL RETRIEVES (DELETE 


and UPDATE, as well) is written in a disjunctive normal 
form. A&A QUERY may be a single conjunction or it mav be a 
disjunction of con JUNCTIONS. The number of conjunctions 


generated in the translation af nested SELECTs utilizing the 
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Various forms af IN, ANY, and ALL nas been noted in Sectians 


ut 


fee anaes 6=6CmOmtlCUvChapter 6IV. Figure il summarizes thi 
information. The figure also specifies the reiatianai 
Operators involved, as well as the source of the values to 


be used im each conjunction. 


Set Opr # Conjunctions Rel Opr Value Source 
IN n = resuit set 
NOT IN 1 ‘= resuit set 
“=ANY n v= resuit set 
< =ANY 1 = max (result set? 
Z-=ANY 1 = mintresult set) 
< ANY 1 ee max (result set: 
ANY 1 x miniresult set? 
.=ALL 1 C= minfresuit set) 
F=ALL f s= max (result set) 
“ALL 1 - minéresuit set? 
FALL 1 e max (result set? 


Figure li. Summary of Nested SELECT 
Set Comparison Operators 


From Figure il, it 1S clear that our translator must 


perform a multiway selection depending upon which set 


comparison operator is utilized at each SELECT level. We 
describe an appropriate algorithm in Subsection 1. It can 
also be seen that, in the case of the operators IN and 


aay 


=ANY, a number of conjuNctTiO0NS are generated, one for eacn 
value aan the result set of the previous operation. ig 
Suosection 2, we present an n-conjunction aigorithm toa 
handle these two cases. Note that in all remaining cases, a 
single canjunction 15 generated. The 1-con JuUneGrsNes 
algorithm is presented in Subsection =. 
i. The Query-Constructor Subroutine 

AS noted above, the top-level transiator portion of 
our SQL interface must determine from the set comparison 
operator tne proper algorithm for constructing the QUERY 


part of the resultant ABDL request. This can be handied by 


a multi-way selection or CASE construct, as shawn in tne 


Query-Constructor Algorithm in Figure i2. The parameters 
passea to Query Constructar are Query Template Ca 
conjunction, described in Subsection 2, constructed to 


facilitate the incorporation at succeeding result sets), the 
Result Set of the previous request, and the appropriate 
Set Opr from Figure 11. 

In each alternative of the CASE statement of Figure 
Le, tne carrect relationai operator 1s chosen, and either 
the n-conjunction or the 1-conjunction subroutine is calied. 
The parameters provided for each subroutine call are tne 
relational operator and the result set of the previous 
operation, or the maximum/minimum value of the result set. 
As previously discussed, when ANY and ALL are used with 


these relational operators, only one value of the resuit set 
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Subroutine Guery Constructor iQuery_Tempiate,Result set, 
Set Opr) 
CASE Set Opr OF 


IN: cel eta = 
Cail MN conjunctioni@Query_Template,Kesuit Set, 
Rel Opr); 


feiss Rel "War <-- =" 
Gali One conjunction t@uery_Template,Result Set, 
Rel Opr); 


“=ANY: Poo = ——- 9 =” 
Call N_ conjunction (@uery _Template,Result Set, 
Rel Opr?; 
<=ANY: neIeOor Sf- <=" 


call One_conjunction (Query Template, 
max (Result Set),Rel Opr): 


P=ANY: Rel JUpte a—=  2=" 
call One_conjunction(Query Template, 
min(Resuit Set) ,Rel Opr); 


ANY: hoes = 
call One_conjunction(Query Template, 
max (Result Set) ,Rel Qpr); 


ANY: Rei 2 Bone-—— >” 
cali One_conjunctioni€Cuery Templates 
min(Resuit Set) ,Rel Opr?; 


<=ALL: Relooor: =—— <=" 
cali OQne_conjunction (Query Template, 
min(Result Set) ,Rel OUpr); 


2=ALL: hero Gpr <.--— ee=" 
call One _conjunction (Query Template, 
max (Resuit Set) ,Rel OQpr?); 


pelts Relbvipre*—--— “= 
call One_conjunction(Query Template, 
mintRKesult Set) ,RKel Opr?); 
2ALL: Rel So Smae—— 
Call One_conjunctiont(@Query Template, 


max (Result Set) ,Rel OQpr); 
END CASE 


END Query Constructor 


Figure i2. The @Query_ Constructor Subroutine 


wl 


1S utilized in the translation. Depending upon which form 
of the set comparison operator 16 used, the selected value 
wili be either the maximum or the minimum value in the 
result set. Therefore. a call to a standard Max or Min 
function, as appropriate, must be made prior to sending the 
resultant single vaiue to the i-conjunction subreutine. ine 
should be noted that the i-conjunction subroutine 1s calied 
In the case of the operator NOT IN. However , there 1s no 
need to utilize a Max/Min function. We also note that a 
cali tO Max/Min is never needed prior to a cali to the n- 


conaurction subroutine. 


ae The N-Coniunction Suoroutine 
In the case of the set operators IN and “=ANY. the 
above Buery-Constructor subroutine will call the n= 
conjunction subroutine. In the process of translatinag 
nested SELECTS which utilize these operators, one 


conjunction of the form 


((FILE = Relname) A (Attrname Rel _opr Value) ) 


wili be generated for each value in the result set. These 
conjunctions are ORed to form a disjunction of conjunctions, 
as explained in Chapter <IV, Sections D and =€eE. An 
algorithmic representation of the n-conjunction generation 
subroutine 1S provided in Figure 153. 


The template, defined in Figure 153, 1S provided by 


the top-level translator as it translates each SELECT level 


ce am AbD RETRIEVE. Value of Template is the only variable 
which requires substitution. For tne innermost (nth ievel) 
SELECT of a nested SELECT request. the equivalent RETRIEVE 
can be constructed completely. However , at translation 


time, the values to be used in the query portion of the 


Subroutine N canjunction(Query_Template,Rel_opr? 


/= Guery Template: x / 
{> 1s ((FILE = Relname) A tAttrname Rel _opr Valued) #/ 
/*® Duery: &/ 
{*% 1s Query _Template V Query_Template V .. . af 
f* VY Query _ Template ef 
/% =/ 
/*® For every value in the Result set #/ 
/* generate one conjunction using Tempiate ¥/ 
/*# tnen OR-concatenate into @uery. #/ 
Rel opr_of Template «<-- Rel _opr 


if Result set is NOT EMPTY 








then 
Value _of Template <-- ist value from Result set 
Guery i-- Query _Template /* Relname & Attrname * / 
/* filled in %/ 
while more values in Result set do 
Value_of Template <-- next vaiue from Result set 
Query «-- Query i: ° V’ ii Template 
end while 
else 
Gueny <=— ~ 7/* Guery 15 nil #/ 


END N conjunction 


Fiqure 1. The N-conjunction Subroutine 


remaining m-1i SELECTS are unknown. Therefore, the template 
1S provided to the N-conjunction generator which fills in 
the. missing values and constructs the QUERY part of each 


morn lEVE. 


= The 1=-Coniunction Stibeeteimee 

in the case of the operator NOT IN and ail of the 
ANY ALL operators containing «=, -=, «4, or +, the (Gees 
statement causes a cail to the i-conjunction subroutine. AS 
described in Chapter IV. one predicate of the form 
(Attribute Rel _ apr Vaiue) 1S generated for each vaiue in the 
recuilt set. These predicates are then ANDed to form a 
single conjunctian. An algorithmic representation of the i- 


conjumctiaon subroutine 15 provided in Figure 14. 


Subroutine OQne_conjunction (Query _Template,Result set, 


Rel opr) 
/* Query Tempiate: */ 
/* 1s ((FILE = Reiname) A (Attrname Rel_opr Valued) #7 
/* Fredicate: =/ 
/* is ‘tAttrname Rel opr Value? af 
f/* Query: 7 
/* 1S Query _Template A Predicate “\ .. . *s 
is A. Predicate */ 


Strip right paren from Query _Template 
Keil opr_of Tempiate <-- Rel_opr 


1f Result _set 1s NOT EMNFTY 








then 
Value of Template «-- ist value from Resulit_set 
Query <-- Querv_Template 
while more values in Result _set do 
Value_of Predicate <-- next value from Resuit set 
Buery <-- Query i: A’ 34 Fredicate 
end while 
else 
Hiery <-—— " /* DBuery 15 mil */ 
Buery Tempiate <-- Query_Template i: ‘°*)° 


Figure 14. The i-conjunction Subroutine 
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Mote. in Fiqure i4, that the template provided to the t- 
conjunction subroutine 1s identicai to that used in the WN- 
conjunction subroutine. An additional data structure, 
Fredicate is defined as tAttrname Rel_opr Value?. The use 
of this additional ‘template’ aliows us to extend the singie 


conjunction, 


({FILE = Relname: A (Attrname Rel opr Value?) 


to the multiple-predicate singie conjunction, 


(CFILE = Relnamed) A (CAttrname Rel opr Valued A. . 


A (Attrname Rel opr Value)) 


The number of predicates generated is determined by the 
mumber of values in the Result _set. 
B. AN ITERATIVE STRUCTURE FOR CONTROLLING THE EXECUTION 

DF ONeSREVble SELECT LENS 

In the previous section, we have presented algorithms 
for building the QUERY part of each ABDL RETRIEVE generated 
im the transitation of a nested S@L SELECT. We now consider 
the process of controlling the execution of this process. An 
algorithmic representation of a simple structure for the 
control of this iterative process 1S provided in Figure 15. 
This WN level Select subroutine is called by the Top-level 
process of the interface (described in Section OC). The 
Parameters passed include a series of AKBRDL RETRIEVE requests 


fin the form of a request stack), and the number, n, of such 


requests. We recall, from Chapter VV. that tne innermost 
SELECT level 15 viewed as the nth-level. Request Stack has 
the ABDL translation af the nth-ltevel SELECT on ton. The 
ist—-level SELECT 1s on the bottom. The stack 1s formed in 
this order because the nth-level request 1s the only request 
containing a fully formed query_part fas described in 
Chapter VV). Each of the other n-i requests requires tne 
Result set oaf the immediately preceding request betore it 


can be sent to MDES for processing. 


Subroutine N_ level SelectiRequest Stack ,n) 


/* Request Stack has the ABDL translation af the #/ 
/® HERS) evel PSeeel lon roe. The ist—-level  SEBeeT eae 
/* 15 on the bottom. Each request in the Stack 15 oy 
/* composed of the reserved word RETRIEVE, Target List, */ 
/* Set Opr, and Query_Part. The Query Part of the 7 
7/* nth—-level SELECT 1s fully tosmea. The Query Fart ef 
/* of the n-1i --> ist-levei SELECTS 1s a query template #*/ 
/* having the form ae 
i ((FILE = Reiname) A (Attrname Rel _opr Value) ) “/ 
/® with a blank in the ‘Value’ position. */ 
Furrent Request «-- Fop (Request Stack) 


Send (Current Request) 
Recieve (Result Set) 


COV Te ==> 1 slOnn— iso 
Current Request <-- Popikequest Stack) 
Call Query _Constructer (Query_FPart,Resuit Set,Set_Opr) 
Send (Current Request) 
Receive (Result Set) 
end far 
Display ‘Result Set) 





end N level Seiect 


Figure 15. An Iterative Frocess for Controlling 
the execution of N-level SELECTS 
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The operation of the N_levei Select subroutine 1s as 
follows. The nth-level request is popped oft the top of 
Request Stack and becomes tne Current Request rnhis 


Current Reaquest is forwarded to MDBS thraugh tre Send 





function. Upon completion of processing, the Result set is 
ebtained through the Receive function. The remaining n-l 
requests are popped off the stack and processed in ecrder. 
The nth and succeeding result sets are incorporated unto 
each request through a call to @Quervyv—-Constructar (descripned 
in Section A). The Send and Receive functions are used on 
each iteration to route request/result traffic between 
NM jievel Select and MD#HS. When the last request has 
completed processing, the final result set is provided to 
the user through a cali to the Display subroutine. Diced ay 
presents the results of the original nested SOL SELECT as 2 


result relation ‘(this is the format expected by a S@L user?. 


fee fT ROUPOSED SOFTWARE STRUCTURE 

In this section, we present a software structure for the 
implementation of nested seiections in our proposed S@L 
interface. In tact, all of the translations heretofore 
introduced in this thesis and in Macy Cfef. GJ, are 
supported by this structure. Therefore, allowing for 
possible modifications required to support additicnal 
multipie and single-relation SOL operations, the software 
structure depicted in Figure 15 represents the oaverail 


software structure of the S@L interface. 


C7 


As depicted in Figure 16, tne S@L interface is comprised 


of a singie top-level process with multiple subroutines and 


functions. The toaop-levei pracess 18s called S@LI (SOL 
Interface). We have described the N_ level Select subroutine 
SOL I 





Get SQL SQL T Send Receive Display 
Gueryv 


Buery Constructor 





PlgGire 71a. The Fraposed Software Structure 
group. We discuss the remaining subroutines as we explain 
the functionality Yet sea An algorithm for S@LI 15 


presented in Figure 1/7. 
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Tne operation of S@LI is an foliows. Once a session 1s 
initiated mom the user terminal, the actions depictecad in 
ALGORITHM S@LI are repeated until session termination. The 
SQL query toa be translated into the equivalent ABDL 
eenstruct is obtained through a cail to the subdroutine 


Get SML Query. This subroutine polls the user terminai tor 


ALGORITHM SLI 


Repeat 
CALL Get S@L Query tOQuery) 
CALL S@LT (Guery,Reqguest Stack,N,Errors) 
if N = 0 then fe OVEeaAt Enh ers */ 
CALL Display (Query) 
CALL Bisplay (Errors) 
else if N = 1 then /* Single Request */ 
Send (Fop (Request Stack) ) 
Receive (RFesuit Set) 
CALL Display (Resuit Set) 





eise |. /*® N-level Request */ 
CALL N_level Select (Request Stack,N) 
end if 


End of session”? 
until end of session 
end ALGORITHM SO@Lt 


Figure 17. The Top-level Frocess of the Intertace, SULI 


input. Note that when a query 1S obtained, the poilina 
stops until the result relation is received by the user (‘or 
syntax errors are displayed for the user). This restrictian 
1s placed in order to preclude the complexity of processing 
more than one request at a time. (le assume that several 
user terminals have access to a copy of S@LI, and that each 
user makes a request and waits for aresult pefore making 


another request). 


To 


The guery abtained by the caill to Get SQL Query 15s 
passed as a Parameter in acall to the S@L Translator 
(SGLT) supreautines S@LT parses the query, recognizes the 
Ruery-type, checks for syntax errors, and translates the SOL 
query to the appropriate AHPL request. If there are ne 
syntax errors, S@LT places the transiated requests in a 
stack and returns this Request Stack, along with the number, 
MN, of requests in the stack. In the case of simple, singie- 
relation cperations, Request Stack contains one request. in 
the case of a nested selectian, S@LT first parses and 
translates the outermost SELECT placing the resuitant 
RETRIEVE request on the stack. AS previously discussed, the 
request contains a query-template. (Fecall that only the 
mth-level., or innermost, reaquest is fully formed). If there 
are syntax erroars, SQLT returns a value af zero far WN. The 
errors are also returned. 

If the number of requests in Request Stack 16 zero iN = 
O?>, then S@LT has detected syntax errors. In this case, 
SQLI makes two calls to the Display subroutine in order to 
provide tne user a display of the query and of the errers 
detected. If the number of requests in Request Stack 15 one 
(M = 1), then the single request is popped off the stack and 
forwarded, via the Send function, to MDES for processing-= 
The Result set 1s obtained through the Receive function. 


The result relation is provided to the user through a call 


to the Display subroutine. If the number of requests i1n 


8O 


Request Stack is greater than one, then N_level Select 15 
Ccalied. The subsequent processing 15 explained in Section 
eo 

As previously discussed, we propose that the Sé@L 
interface be implemented such that S@LI and 1ts subroutines 
are resident on a host computer. This precludes the need to 
place an additional workload on the MDBS Controlier. In 
effect, MDSS is "unaware" that the user 15 making database 
requests if oGL , and the user need oniy know wnat 
information 1s desired and how to form the request in the 
syntax of S@L. The logical structure of the system i5 


Gepictec in Figure 18. 


in 
host 
computer 


in the 
backends and 
their controller 





BLreure 13. The Logical Structure of the System 
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Vi ADDITIONAL SBL-TO-ABDL TRANSLATIONS 


We have described single-relation set membership and 
multipie-relation nested S@L SELECT aperations. For each 
SQL operation, we have developed the appropriate AKHRDL 
translation. In Chapter VI, we have proposed a software 
structure to facilitate the implementation of these 
transiations, In addition to the simple, singie-relation 
translations which Macy CRef. 8] has provided. In this 
chapter, we investigate other selected single-relaticn and 
multipie-relation S@L operations. Inclusion otf these highly 
desirable options in the S@L set operations supported by the 
interface further demonstrates the power of ABDL to support 
relational operations. 

AS in previous chapters, the approach of this chapter is 
to describe each SOL operation and then determine which ABRDL 
constructs can be used to support the operation. AS each 
transitation is develaped, we show graphically, 
algorithmically, and through text how the software structure 
of the interface (described in Chapter VID must evolve 1n 
order to accomodate the additional operations. The single- 
relation operations are presented in Section A, and the 
multiple-relation operations are presented in Section B. In 
Section C, we present the modified software structure of the 


SOL interface. 


A. SELECTED SINGLE-RELATION OPERATIONS 
The single-relation operations selected for discussion 
in this section include: updating muitiple attributes ina 
single record; retrieving groups of attributes which satisty 
a2 group condition; retrieving computed values; providing 
format options; retrievina ordered attributes CSOT: and 
eliminating duplicates (PROJECTION). These aperations are 
commonly supperted in commercial relational database systems 
utilizing the S@L language. © SOL-trained user of the 
interface croposed in this thesis would expect to be able ta 
utilize familiar S@L constructs to perform these operations. 
We address the SQ@L-TQO-ABDL translations in the followina 
subsectians. 
Eee Updating Multiple-Attributes 
All data languages provide a data update capability. 
OF ainmterest here 15 the S@L construct for update. This 
construct allows the user to change the values of any number 
of attributes stored in the record by iussuing a single 
query. This capability 1s both convenient and efficient. 
The following example depicts the updating of muitiplie- 
attributes (fields) In a Single record. If we wish to 
change the coior of part P2 to yellow, increase its weight 
by 3, and set its city to Normandy, we may use the following 


SQL query: 


UPDATE e 


Sew COLOR = ‘Yellow’, 
WETGHT = WEIGHT + 3, 
CITry = “Normandy ’ 
WHERE P# = ‘PO’ 


In this example, we are updating the attributes COLOR, 
WEIGHT, and CITY in a single record with primary key, Pas 
The record 15 contained in the Farts (Ff) relation. Note 
that any reference to an attribute on the right-hand side of 
an equals sign refers to the value of that attribute prior 
to updating. 

In studying the S@L example above, we note that 
there are three cases to consider depending on the 
attributes listed in the SET and WHERE clauses. We refer to 
these as case-0, case-1, and case-2 updates. To facilitate 
the following explanation, let S be the set of distinct 


attribute names listed in the SET clause, and W be the set 


of distinct attribute names listed in the WHERE clause. In 
case-9 updates (e.g., the above example). no attribute 15 
listed in both the SET and WHERE clauses (i.e., SfiW= 0). 


In case-1l updates, one attribute 1s listed in both ciauses 
(l1.e., cardinality(S (1) W) = 1). In case-2 updates, multiple 
attributes are listed ri both clauses 
(i,e., cardinalityv(S (a A case-1i modification of 


Our example is as follows: 
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UPDATE = 


Sey COLOR = ‘Yellow’, 
WETGHT = WETGHT + 3S, 
ey = ‘Normandy’ 
WHERE (P# = °P2') AND CCITY = ‘Faris ) 
Note, ie is in both S and W, and the cardinality of 
foe, ) We is tit. A case-2 modification of our original 


example is as follows: 


UPDATE P 


SET COLOR = ‘Yellow’, 
WEIGHT = WEIGHT + 3S, 
CITY = ‘Normandy’ 
WHERE (P# = °P2’) AND (CITY = ‘Paris’) 
AND (COLOR = ‘GREEN’) 
Note, ely and COLOR are in both S&S and W, and 


cardinality(S (\W) ? 1. The S@L-to-ABDL transiations of the 
three cases of multiple-attribute update are described iin 
the following subsection. 
Aa. The translation to ABDL 

ABDL does not provide a single-request construct 
which updates more than one attribute in a record. We must 
translate the SQL UFDATE into multiple ABDL UPDATEs. Case-G 
SQL UPDATE queries can be translated directly to multiple 
ARBRDL UPDATE requests. The order in which these requests are 


Processed is immaterial. The case-O example above 


transiates to the foliowing three independent ABDL UPDATE 


requests: 


UPDATE ‘iCFILE = FP) A (P# = F2)) (COLOR = Yeliow) 
UPDATE (C(FILE = Pd) A (PR = FP2)) (WEIGHT = WEIGHT + 3) 


UPDATE ((FILE = F) A (P# 


P2)) (CITY = Normandy) 


Dur case-i example translates to the same three 
UPDATE requests, however, the presence of tne CITY attribute 
im beth the WHERE and SET clauses effects the structure aft 
the translation. The order of request processing now 
becomes important. For examole, if CITY 1S updated first, 
the condition ((P# = °P2°*) AND (CITY = “Paris’)) is no 
longer satisfied when a subsequent attempt 1s made to 
process the COLOR and WEIGHT UPDATE requests. ARDL provides 
A construct called a Transaction which specifies the order 
in which a series of requests must be processed. Therefore, 


the case-1i translation becomes 


REGIN Transaction 


UPDAME. ({F ILE 


FPF) A (CFP# = F2)) CCOLOR = Yellow) 


UPDATE ((FILE FP) A (P# = P2)) (WEIGHT = WEIGHT + 3) 


BE DaAte) ( (rie P) A (FH# = FP2)) (CITY = Normandy?) 


END Transaction 


Requests within a transaction are processed in the same 
order as they are specified. Therefore, we can obtain a 


correct result. 
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The case-2 example aiso translates to a series 
ot three ARBRDL requests. However , the translation 1S more 
comple™. In this case, multiple attributes specified in the 
WHERE clause are also listed in the SET clause. When the 
first of these attributes is updated, all subsequent 
attempts to update the remaining attributes will fail. 
Since the WHERE condition is no longer’ satisfiabie, the 
record can not be found. The following sequence of AHDL 
requests accomplishes the requested update. (Note that the 
ABDL UPDATE construct is not used). 

RETRIEVE (CFILE = PF) A (P# = P2)) <FR,PNAMNE,COLOR,WEIGHT, 
eaiy.> 
DELETE ((FILE = P) A (P# = P2) A (PNAME = Bolt) A 
(COLOR = Green) A (WEIGHT = 17) A 
(CITY = Paris)) 
INSERT (<FILE = FSy,tFH = P2>,<PNAME = Bolt:, 
“COLOR = Yellow?,< WEIGHT = 222,<CITY = Normandy+=,:} 
Ds A proposed Software Structure 

In order to implement multiple-attribute 
updates, we must augment the functionality of the software 
Structure (SGLI) which we have developed in Chapter Vi. We 
specify an additional parameter, Request Type, to be 
returned by S@Q_T. When the value of Request Type is 
‘CaseO_update’, the subroutine CaseO update is called. In 
this case, the multiple ABDL RETRIEVE requests are simply 
removed from Request Stack and forwarded to MDBS- for 


processing. As previously stated, the order of processing 


= 


does not effect the result. When ali updates are complete. 
the user 1s so informed. When the value of Request Type 1s 
‘Casel_ update’, the subroutine, Caseil update is called. 
Wnen the value of Request Type 15 “Case? update’ .the 
subroutine, Case? update 1s called. The Casel_update and 
Case? update subroutines are presented in Figures 19 and 20, 


respectively. 


Subroutine Casel update (Request Stack,Result Set) 


/* Transaction Request: ef 
Te 1s a tempiate with the Reserved word BOT e/ 
/* followed by multiple blank lines (to be used ¥/ 
/ * by the series of requests) and the Reserved HG 
{/ * word EQT. ¥/ 


while NOT EMPTY tRequest Stack) do 
Fop (Request Stack) 
Fill in blank lines of Transaction Request with 
requests from Request Stack 
end while 
Sendt Transaction Request) 
Receive (Result Set) /*® Result Set returned to % / 
f/* calling routine ee 
end Casei_update 


Figure 19. Subroutine Casel Update 


The Caseli Update subroutine builds a transaction 
of update requests for MDBS processing. The subroutine 1s 
provided the parameter Request Stack which contains multiple 
UFDATE requests stacked such that the request on the bottom 


of the stack 1s the request which must be processed last. 
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Subroutine Casei Undcate sends the request transaction to 
MDBS, Receives the Resuit_Set, and returns the Resuit_ Set ta 


the calling routine. 


Subroutine Case? update (Request Stack,Result Set) 


/* Insert_Template: =/ 
i 15 the INSERT request with values for the &/ 
/* attributes-to-be-updated and bianks for the = / 
ce attributes whose values are obtained by the 7 
f* RETRIEVE request.  / 
Send tFop (Request Stack)) f/® RETRIEVE request wf 
Receive (Result _Set) 
Send (Pop tRequest Stack)) /*® DELETE the record ete 
Receive(Result Set) /* deletion 15 compiete “/ 
While there are records to update do 
Insert Template <-- /* 111 1n Blanks with retrieved * f/f 
/* attribute vaiues ee 
Insert Request <-- /* form the INSERT request from ee 
/® the record and Insert_Tempiate */ 


Send(Insert Request) 
end while 
Receive tResult Set) /* INSERT 1s compliete ef 
end Case? update 


ne 


Figure 20. Subroutine tCase2 update 


The Case2 Update subroutine controls che 
execution of the RETRIEVE-DELETE-INSERT series of reguests. 
The RETRIEVE obtains a copy of the pte a: Wate recordits). 
The DELETE deletes the original record(s) in the database. 
The INSERT re-inserts the record(s) with all the modified 
attribute values. 

<=. Retrieving Qualified Groups 
Roth SQL and ABDL provide an option whereby 


retrieved attributes may be grouped. For example, if we 


a? 


wish to obtain the part number and the total quantity for 
each part supplied, we may utilize the following SELECT 


construct: 


SELECT PHReSereeTY? 
FROM he 


GROUP RY P# 


The result relation iss: 


600 
1000 


406 
JOO 
BOO 
eye! 





Note that "“...each expression in the SELECT clause must be 
Single-valued for each group; that 1s, 1t can be either the 
GROUP _ BY field itself, or a fumction such as SUM that 
Operates on all values of a given field within a group and 
reduces those values to a single value.” (Ref. 7] 

The above S@L operation is directly supported by the 
software structure oof Chapter VI. Using the SELECTsea— 
RETRIEVE mapping which we have described in Chapter III, the 


equivalent ABDL construct is: 


RETRIEVE (FILE = SRI=PHS SUN ei ee ee 


S@QL provides a further option for use with grouped 
attributes. Once the rows of atable are grouped by a 


selected attribute, groups not satisfying a specified 
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eondition can be eliminated through the use of the HAVING 
operator. The following comprehensive example clarities the 
use of the “GROUP BY with HAVING’ option. It we wish to 
obtain the part number and tne maximum quantity of the part 
supplied for all parts such that the total quantity supplied 
1S greater than 300 (excluding from the total all shinoments 
for which the quantity is less than or equal to 200), we may 


use the following query: 


SELECT P#,MAX (COTY) 
FROM Se 

WHERE OLY = 200 
GROUP BY P# 


HAVING SUM(@QTY) > 300 


We can imagine the resuit relation 





being formed as follows. A copy 1s made of table SF (FROM. 
The rows not satisfying "QTY = 200" are eliminated (WHERE). 
The remaining rows are then grouped by P#H (GROUP BY). The 
newly formed groups are checked against the predicate 
"SUM (QTY) SON, Those not satisfying the condition are 
eliminated CHAVING). Finally, part numbers and maximum 


quantities are extracted from the remaining groups (SELECHO 


! 


a. The Translation to ABDL 
== previously discussed, ABDL provides a 
construct for the retrieval of data which 15 qrouped by a 
selected attribute. In the comprehensive S@L example above, 
the use of the HAVING operator specifies a further 
Gualification on the groups. In this example, the groups 


whose total quantity supplied is less than or equal to 200 


are to be eliminated. ABRDL does not provide a facility tor 
checking this group condition. This condition must be 
checked in the interface. The SOL query is translated to 


the ABDL request 


RETRIEVE (({FILE = SP) A (@TY > 200)) <P#,MAX (QTY) ,SUM(QTY) > 
BY PH 


which we imagine returns the following table: 


MAX (aTY) | SUMCQTY) 








Software ain the interface then checks the HAVING condition 
“SUM GT YS 30G". This eliminates the grouping for part P4. 
The remaining part numbers and maximum quantities are 


returned to the user. 


Ds A Froposed Software Structure 

When S@LT returns the value, Oreavipepy having 
for the parameter, Request Type, we assume that the HAVING 
condition is also made available to the Group-By-Having 
subroutine. (We make asimilar assumption for other 
Request Types). The subroutine sends the request, receives 
the result set, checks the HAVING condition, and returns 
only those tuples satisfying the having condition to the 


user. Figure 21 depicts this aperation. 


Subroutine Group—-By—-Having (Request Stack ,HAVING_ condition, 
Result Set) 
Send (Fop (Request Stack) 
Receive tResult Set) 
Eliminate groups not satisfying HAVING condition 
end Group By Having 


Figure 21. Subroutine Group By Having 


a Retrieving Computed Values 


The concept of retrieving computed values is simple, 
yet it typifies the iaumportant options that database 
management system designers are providing in order to ensure 
user-friendliness and user-flemibility. This option supports 
the inclusion of arithmetic expressions involving fields as 
well as simple field-names. For example, the user should be 
able to specify units—-of-measure for numerical results. SQL 
supports this concept. If we wish to obtain the part number 


and the weight of the part in grams (given in table F' in 


0 
L4 


rounds), we may use the following query: 


£ 


SELECT PH,WEIGHT *« 454 


FROM 


The resuit relation 18s: 





A. The Translation to ABDL 
In this translation, the ABDL request retrieves 
the aundicated attributes leaving any computation to be 
accomplished in the interface. For the example above, the 


ABRDL translation is 


RETRIEVE (FILE= FP) <P#,WEIGHT> 


The specified arithmetic operation 1S performed by interface 
software on the retrieved values for WEIGHT (1.e., WE TGH Tie 
454) prior to returning the final result relation to the 
user. The software required 1s a Simple aunterpreter “for 
evaluating arithmetic expressions. 
iD. A Proposed Software Structure 

An Expression Evaluator subroutine can be used 
to accomplish the arithmetic operations specified in the S@L 
query. The subroutine simply utilizes the appropriate 


function fe.g., Mult,Add,Sub,Div) to perform the operation. 
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4, Providing Format Cptions 


Often, the information retrieved from a database 15 


intended for use in published reports. The availability of 
formatting options can make generating these reports 
simpler. For example, while 1 18S prudent to save disk 


space hy storing the names of suppliers as values fore an 
attribute-name such as SNAME. an end-user unfamiliar with 
the database 16 psychologically more comfartabie with a 
column heading such as SUPPLIERS. In SOL queries, the 
desired format 1s indicated in the SELECT clause. nets 
example, if we wish to obtain the names of all suppliers, we 


may use the following query: 


Soult ieeoNGhic, SUPPLIERS 


FROM 3 


SUR PE IeERS 


The result relation is: 





Note that the column heading is SUPPLIERS rather than the 
field name, SNAME. 
a. The Translation to ABDL 
This translation 1s similar to that presented in 
Subsection 2 above. Information, returned from MDBES, 1s 
modified by the interface software. The S@L SELECT query is 


translated to the ABDL request 


RETRIEVE (F LLB = Soo Seine 


The results of this request are modified by the S@L 
interface (SOLI) prior to retttrning ene eae. result 
relation to the user. In this case, the column hnheadina, 
SNAME., 15 changed to the new heading, SUPPLIERS. 
b. A Proposed Software Structure 

Format options can be provided in the Dispiay 
subrautine. Any change in the form of the table heading can 
be passed at the time of the call to Display. 

a The Retrieval with Ordering (SORT) 

Generally, the resuit of a SELECT operation is not 
Quaranteed to be in any particular order. Ordering (SORT? is 
normally not accomplished in S@L queries unless specificaily 
requested by the user. This operation may be costly, and 
the additional expense 1s often unwarranted. In S@L, “aie 
user may specify ordering through the use of the ORDER BY 
Operator. As an example, if we wish to obtain supplier-— 
numbers for all suppliers providing shipments, such that the 
result is ordered by supplier-number, we may use the 


following query: 


SELECT UNIQUE St 
FROM S] 


ORDER BY S# 
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me tmesuit relation is: 





A. The Translation to ABDL 
In the translation of the above SQL query, we 
assume an ordering capability within MDBS. The development 
of this capability 1s the goal of a current thesis by Muldaur 


[Ref 15]. The ABDL request 


Beweheve Pee — Sere ote ORDER BY S# 


returns ail supplier numbers (ordered by increasing supplier 


numbers) contained in the SF file Cimcluding duplicates). 


b A Proposed Software Structure 
be assume that the ordering of selected 
attributes 15 directly supported by MDBS. Therefore, no 


augmentation of S@LI is required. 
6. An Elimination of Duplicates (PROJECTION) 


The results of a SELECT operation may contain 


duplicates. The elimination of duplicates (PROJECTION), as 
in the case of retrieval with ordering (SORT), is normally 
not accomplished in S@L queries unless specifically 
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requested. Aqain, the cost 1s high and often unwarranted. 
Am exception toa this rule 1s that duplicate rows are 
automatically eliminated in UNION operations. (UN TON 
operations are described in Section #H). 

In SOL, the elimination of duplicates may be 
specified through the use of the UNIQUE operator. AS an 
example, If we wish to obtain supplier-numbers tor all 
suppliers providing shipments such that no supplier-number 
IS )~6« «6listed)= more than once, and the result 1S ordered by 


supplier-—-number, we may use the following query: 


SELECT UNIQUE St 
FROM Sr 


ORDER RY S# 


The result relatian i116: 





This example is a modification of the example presented in 
subsection 93. Note that duplicate supplier-numbers are 
eliminated. 
a. The Translation to ABDL 
The ABDL translation for the above S@QL query is 


identical to the translation for our Subsection 3S example. 


Again, the ABDL request 


RETRIEVE (FILE = SP) <S#> Sheer ee eo 


oe 


returns ail supplier-numbers ‘ordered by increasing 
suopplier-number s) contained in the SF file (including 
duplicates). Since UNIQUE is specified in the SELEUCT clause 
of the S@L query, SQLI must check the erdered rows 
eliminating duplicate values for the S# attribute prior to 
forwarding the result relatian ta the user. Tf our example 
15 modified stuch that the ORDER BY clause is omitted, we may 
facilitate the elimination of duplicates by "forcing" a SORT 
ef the selected attribdutes. That is, the ABDL RETRIEVE 
request 1s written to include an ORDER BY specitication. 
b. A Proposed Software Structure 


When UNIQUE is specified in the S@L query, the 


Result Set from MBBS iS passed in a call zo A 
Duplicate Eliminatar subroutine. This subroutine scans and 
compares adjacent members of an ordered Result Set 
eliminating duplicate members. We assume that the 


Result Set is always ordered prior to beinq passed to 
Duplicate Eliminator. The ordering is either user-specified 


or "forced" in the S@GLT translation. 


B. SELECTED MULTIPLE-RELATION OPERATIGNS 

In this section, we discuss two additional multiple- 
relation operations which are supported by SQL: retrieval! 
using the UNION operator and retrieval specifying JOIN 
operations. These two operations and the nested SELECT 
{described in Chapter V) give SOL much of its power and 


feeexibility. The availability of query constructs which 


oo 


allow access 
enhances 


information fram 


to related datavin 


the ease with which a user can abtain the 


the database. 


multiple tables 


We investigate UNIGN 


JOIN operations in the following subsections. 


is The Fetrieval 


From set theory, 


Using UNION 


greatly 


desired 


and 


we recall that the UNION of sets @ 


and a (12. A UNION HB) is the set of all objects » such 
that ©“ 1s a member of A Or x 1s a member of &#. The formal 
predicate logic definition of A UNION B is: 
Vx OC (x Ad WV «(x B) J 
in, Sée, the UNIQN operator is used in aA query 


comprised of multiple-SELECT constructs. 


wish to obtain numbers for parts that either weigh 


we 
than iS pounds or are currently supplied by supplier S2 
both), we may use the following query: 


SEES 


FROM 


WHERE 


UNION 


Seer 


FROM 


WHERE 


P# 


P 


WETGRT > oie 


Fi 
SP 
S# = ‘S2 
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As an example, if 


more 


(ar 





From the sampie database of Chapter I, we can see that parts 
F2, FS, and P& weigh more than 16 pounds (x CA). areata a 
welaghs less than 16 pounds, however, Fl 15 currently 
Supplied by supplier S2 (x€B). Part F2 weighs more than 16 
pounds and is supplied by supplier S2 ({xE Ad) A (x C B)). 
Note that duplicate rows are eiiminated from the result of a 
UNION operation. 
Ge The Translation to ABDL 

In the S@L query above, each SELECT construct 

translates into an equivalent ABDL RETRIEVE request. In 


this example, the two ABRDL requests 


RetnreVE (FILE bo VO ote = Lo) PH: GRDER BY PF 


Berle Ve «(File Sr 7 Sot — Se <PHS GRDER BY PH 


are processed concurrently. The results are combined in 
SQ@LI, where duplicate rows are eliminated. The remaining 
raws are forwarded to the user. 
Db. A FProposed Software Structure 
When the value of Request Type is UNION, the 
translation and processing are as follows. An MDBS SORT is 
specified in the ABDL translation. A subroutine called 


UNION pops all ABDL RETRIEVE requests off of Request Stack 


HO 


and forwards them to MDBS for concurrent processing. The 
ordered result sets are merged (‘through the use of a 
standard merge function), and then passed to 
Duplicate Eliminator. Finally, the uniquely selected 
results of the UNION operation are returned to S@LI for 
display to the user. Subroutine UNION is presented in 


rl eltg@e ou. 


Subroutine UNION Request Stack,Result Set) 
while NOT EMPTY tRequest Stack) do 
Send (Pop (Request Stack) 

end while 

ReceivetiResult Seti) 

Receive (Result Set2) 

Merge tResult Seti,Result Set2) 

CALL Duplicate Eliminator (Result Set) 

end UNION 


Figure 22. Subroutine UNION 


ae The Retrieval Specifying Join Operations 

Join operations are characteristic of data languages 
intended tor use with relational databases. SQL provides 
the capability to specify implicit join, equality join, and 
inequality join operations. In an implicit join, attribute- 
Yalues in multiple tables are compared, however, the vaiues 
returned to the user are taken from only one table. 
Implicit joins can be formed through the use of the nested 


SOL SELECT constructs which we have described in Chapter V. 


In the nested SELECT, multiple tables are accessed and the 


Values of selected attributes are compared. We note that 
only values from the outermost SELECT are returned in the 
final result set. This operation results in the formation 
of an implicit join. 

Equality goin and inequality join operations are 
specified by referencing multiple tables ina single SELECT 
query. AS an example of an equality join, if for each part 
supplied we wish to obtain part numbers and names of all 


cities supplying the part, we may use the following query: 


SELECT UNIQUE F#,CITY 
FROM SsP,s 


WHERE SP.S# = S.S# 


The result relation is: 


1 
1 
1D 
2 
ay 
A 
5 
& 





Note that table-names may be used as qualifiers in the 
SELECT and WHERE clauses in order to resolve ambiguities or 
to ensure clarity. For example, the SELECT clause may be 


equivalently written 


SELECT UNIQUE SP.P#,S.CITY 


HOS 


Although there are optimization techniques which 
facilitate a more efficient implementation, we can visualize 
the join operation as follows. First the Cartesian product 
of SP and Sis fermed. Then, rows not satisfying the 
condition SP.S# = §.5# are eliminated. Next, columns F# and 
CiTY are projected from the remaining rows. Finally, since 
the operator UNIQUE 15 used, all duplicate rows are removed 
before the result relation is returned to the user. (For an 
indenth giscussion of the efficiency and optimization 
consideratians of implementing join operations, the reader 
is referred to Demurijian CRef. el) 

a. The Translatian to ABDL 

The attribute—-based data language, as 
implemented in MDBS, does not provide a join capability. 
Metldur CRef. 1513 1S currently investigating the practicality 
of incorporating join operations within MDES. Tf we assume 
that the functionality of MDBS 1s augmented to support the 
ecuality join and inequality join operations, we might use 
the following translation for the equality join (as 
discussed in Demurjian (Ref. La dae The general form of a 
Simple, two-way equality join expressed in the syntax of SQL 


is 


SELECT sel _expr_list 
FROM relation_namel, relat1ion_name2 


WHERE relation_namel.attribute = relation _mame2.atertomee 
AND qualification 
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The qenerai form af the ABDL translation 15 


SenRreve (attribute _list_1) i{iquery_1l1) 
CONNECT ON (attribute 1, attribute 2) 


(attribute_list_ 2) (query_2) 


The sel _ expr _ list of the S@L SELECT is divided into a target 
list consisting of attributes from relation_namei and a 
target list consisting of attributes from relatiaon_namez. 
The qualification of the SOL SELECT is likewise partitioned. 
The attributes named in the equaiity predicate become the 
object of the CONNECT ON clause in the ABDL request. 
Following this general =~¢ftorm, the translation for the 


equality join example of the preceding subsection 15s 


RETRIEVE < (S#,FH) (FILE = SP) > 
CONNECT ON (SP.S#, S.S#) 


< (S#, CITY) (FILE = S) > 


bd. A Proposed Software Structure 
As stated previously, we assume a 301Nn 
capability for MDBS. Therefore, mo augmentation of S@LI is 


required. 


. THE MODIFIED SOFTWARE STRUCTURE OF THE S@L INTERFACE 

In this section, we present the modified software 
Structure of SO@LI. We modify the structure which we have 
presented in Chapter VI in order to facilitate the 


implementation of the additional operations described in 
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this  cechapter= Tne modified version of the top-ievel 


process, S@LI, is shown in §igquge 2. Note, we have 
simpiified this algorithm through the use ot the 
Request Control subroutine. The functionality of this 
subroutine 1s presented in Fiaure 24. The purpose of 


Request Controi 1S to provide overall control of request 
processing for the interface. A high-level view of the 
modified software structure 1¢ shown in Figure 25, and the 
relationship between Subroutine Request Control and its 


subordinate group af subroutines is depicted in Figure 26. 


ALGORITHM S@LI (Modified) 


Repeat 
TALL Get_SOL Query (Query) 
CALL S@LT (Query,Regquest Stack,N,Errors,Request [Iype, 
Format _Option,Arith_Expr) 
if N = © then /*®-SyYNcCax Sheors 27 
CALL Display (Query) 
CALL Dispiay (Errors) 
else 
CALL Request Control (Request Stack,N,Request Type, 
Arith_Expr,Resuit_Set) 
CALL Display (Result Set,Format_Option) 
end if 
End of sessian? 
until end _of session 
end ALGORITHM S@LI (Modified) 





Figure 235. ALGORITHM S@LI (Modified) 


Subroutine Request Control (Request Stack,N,Request Type, 
Arith_Expr,Resuit_Set) 

CASE Request Type OF 

CaseO Update: CALL Caset Update (Request Stack ,Resuit Set); 


Casei Update: CALL Casei Update (Request Stack ,Result Set? 


‘ae 


Case? Update: CALL Case? Update (Request Stack,Result Set); 


Greup Having: CALL Group Having (Request Stack, 
Condition,Resuit Set); 


UN TON: CALL UNION (Request Stack,Resuit Set); 


Others: if N = 1 then 





CALL One Request (Request Stack,Result Set? 
/*® for simple, directiy-supported #/ 





/* single request = / 
else 
CALL N_ Level Select (Request Stack, 
N,Resuit Set) 
end if 


END CASE 


Fiqure 24. Subroutine Request Control 


oy 


Soba 


SOLT 





Progure 2. A High-Level View of the Software Structure 
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Cased Case2 
Update Update 





Union One 
Request 
| 


Casel Group N Level 
| Update Having Select 


Lee on ume EL ON |) +. N Conjunction 


Dupe 
Elim 


Figume 26. Request Control and its Subroutines 
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by balas CONCLUDING REMARKS 


In this thesis, we have concentrated on the lanquage 
interface aspects of using an attribute-basced database 
system, MDBS, as a kernei for the support of the relational 
data model and the relational query language, S@L. = 
related thesis by Wel1shar [CRef. 16] provides the design and 
analysis of an interface far the hierarchical model and the 
hierarchical data language, ODL/I. This work 1S part of 
ongoing research being conducted by the Laboratory for 
Database Systems Research under the direction of Dr. David 
Ea Hsiaq. As stated in [Ref. 113, the goal of this phase of 
the laboratory's research "s.e1S to provide Increased 
utility in database computers. A centralized repasitory of 
data 1S made available to multiple, dissimilar hosts. 
Furthermore, the database 1S aiso made availabie to 
transactions written in multiple, dissimilar data 
ianquages.”" 

The rapid evolution of database technology nas provided 
the motivation for this research. Commercial database 
management systems have only Deen avaliable since the 
1940's. Today, Organizations of ail types are criticaily 
dependent on the operation of these systems. This 


dependency comes from the need to centrally, eon wes large 


Lo) 


Quantities of operational data. The information must be 
accurate and readily accessible by relatively inexperienced 
end-users. 


There are three generally known approaches to the design 


of database systems. These are the network, hierarchical. 
relationai approaches. An organization normally chooses a 
commercial system based on one of these models. Tne 


database must be created and operator and user personnei 
must be trained. Because of the re-programming ana  re- 
training effort (Cand money) required, an organization is 
unlikely to cnange to a system based on ane of the oather 
models. 

We have discussed an alternative to the development of 
separate stand-alone systems for specific data modeis. In 
this proposal, the three generally known models and their 
modei-pasecd data ianguages are supported by the attridute- 
based data model and data language. We have shown (in the 
relationai case) how a software interface can be built for 
such support. 


Specific contributions of this thesis include extremeiy 


thorough xplanations of S@L operations such as: set— 
membershio, nested retrievals, retrieval of grouped 
attributes, JOin operations, retrieval of computed values, 
providing format options, retrieval using UNION, updating 


multiple fields. retrieval with ordering, and elimination of 


duplicates. We have extended the work of Macy (CRef. G1] by 


Pele 


showing tnat many of the S@L constructs for the above 
cperations are directly supportable by ABRDL and WMDBES. 
Orhers can be translated into a series of the primary and 
aggregate operations of the attribute-based system. In all 
cases, StiL—-to-ABRDL translations are provided. We have also 
proposed a software structure to facilitate the future 
implementation of the SOL interface. 

S& major design goal has been to design a SQL intertace 


to MDBS without requiring that changes be made to the MDBS 


system. Wo have shown that the complete interface can be 
implemented an a host computer. Ali transiations are 
accomplished in the SQL interface. iDBS continues to 


receive and process requests written in the syntax of ABDL. 
We have also shown that the interface can be ecesianed toa 
utilize existing ABDL constructs (either one or a series of 
AHDL requests). No changes to the ABDL syntax are required. 
We also have not proposed any changes to the syntax of SOL. 
We have designed the interface to be transparent ta the S@L 
user. The intention is that a trained S@L user need know 
nothing of tne existence ef the interface or of NDEBS. Tne 
user can log in at a system terminal, input a SQL query, and 
obtain result data in a relational format. 

In retrospect, Our unconventional bottom-up appraach to 
design seems entirely appropriate. We have built upon the 
basic subset of SG@L-to-ABDL mappings provided by Macy (CkRef. 


8], making additions to the cet as selected SQL operations 


nave heen incorperated into the interface. AS Our 
investigation begins in Chapter IV, tne ferm of the 
interface software structure 1S not clear. When tne nested 
SOL SELECT is described in Chapter V, the requirements for 
the structure begin to solidify. We become aware that an 
iterative structure is needed to controi the processing of 
series of SBDL requests. As the aigorithm, SQ@_i, is 
completed in Chapter VI, 1t 185s clear that we have developed 
the overall software structure for the S@L interface. The 
functionality of the structure 15 enhanced as additional S&L 
operations are selected. However, the qeneral structure 
remains intact. 


As an alternative to implementing the S@L (network and 


nhierarchical, as well) interface on a nost computer, the 
imterface can be Placed inside of MDHS. We have studied 
this bpessi1b2)1ty ; and recommend against euch an 
implementation. A major desiqan goal of MDBS 15 to minimize 
the role of tne controller. This qoal can not be attained 


If =6tthe |= 6controller must Support the operation of resident 
relational, network, and hierarchical interfaces. 

We have shown that the attribute-based system supports 
relationai database applications. We have provided S@L-to- 
ABDL translations for selected database operations, and we 
have proposed a software structure to facilitate 
implementation. The next step is to implement the interface 


on a host computer. In order to finally determine the 
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overail practicality of using MDES as a kernei database 
system, we must also implement the hierarchical interface 
design of Weishar CRef. LGA Additianaliy, an interface to 


support the network model must be desiqned and implemented. 


APPENDIX A: 


FORMAL SPECIFICATION OF THE ATTRIBUTE-BASED 


DATA LANGUAGE, ABDL 


The following 


language developed 
Square brackets E J 


constructs. 


Fredicate 

attribute 

attribute being_modified 
base _ attribute 


Yalue 


Conjunct 


Quer y 


Stat 
list el 


list 


Target_list 
Attrib _val_ pair 


Half record 


Record 


are 


is the BNF for the attribute-oased 


Dy Hsiao and Menon CRefs. ah 


il 


data 


and Tl 


used to indicate optional 


attribute rel op value 
char _string 

attribute 

attribute 

string 

i Number 


i Float 


(Predicate) 


© (€Conjuncet £ Predicate) 


Conjgunct 
* Query #/ Conjunct 
AVG +: MAX | MIN +: SUM + COUNT 
Stat (attribute) 
attribute 
list el 


» List,attribute 
+ List,list_el 


(list) 
fattribute,value> 


Attrib val pair 


t 


(| Half record, Attrib _val_pair 


(Half record) 
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Foimncer 
riodi1f1er 


type-O 


type-I 


type-II 


type-III 


type-IV 


Request 


Insert 
Delete 
LUipndate 


Retrieve 


wuc-~-Letter 


string 


lco-letter 


char _ string 


digit 


number 


it 


number 
type-o 

i type-I 

: type-II 
i type-IIl 
i; type-IV 


<attribute being_modified 
value> 


“attribute-being_modified 
expri> 


Sattribute being modified 
expr2> 


“attribute being modified 
expr2 of Query> 


Sattribute_being madified 
expr2 of Fointer => 


Insert : Delete : Update | 
Retrieve 


INSERT Record 
DELETE Query 
UPDATE Query Modifier 


RETRIEVE Query Target_list 
CRY attributed 
CWITH Fointer ] 


Ao EB ees ee 


uc letter 


(+ String uc_letter 


uc letter 
i char_strinmg le _ beeten 


{2:3 | 4 {ieee 
1 g 


1 
= 


“IO 


Giaaat 
i digit number 


fioat 
add op 
mult op 


expr il 


arith_terml 


arith_factorl 


expor2 


arith_term2 


arith _factor2 


number .number 
4+ i= 
& | ff 


arith_termli 
© expri add op arith termi 


ecithr factor 1 


© arith termi muit_op 
arith factorl 


attribute being modified 


; number 


arith term2 


i expr2 add op arith term2 
arith factor2 


i arith _term2 mult _op 
arith factor2 


base attribute 


i number 


Pay 
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